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Introduction to the 
Dover Edition 


It is now 2012 and again I am teaching Fractal Geometry and Dynamical Systems 
with Fractals Everywhere as the main text. Each year I teach the course differ- 
ently, always trying to make the subject new, fresh, immediate, and related to my 
research interests at the time. In 1985, when I first taught this material, in effect an 
introduction to iterated function systems, there was no suitable text. Now, as then, 
I think that iterated function systems (IFS) are a simple, beautiful, deep, unified, 
natural, mathematical framework that all undergraduates in the mathematical sci- 
ences and engineering should know. Via iterated function systems, a student meets 
and uses affine geometry, topology, measure theory, dynamical systems, and prob- 
ability. I note that the interplay of geometry and probability is readily understood, 
at an intuitive level, by all students who have done a year of calculus. 

IFS theory provides a direct route to the heart of fractal geometry; for example 
many problems in fractal geometry may be illustrated using IFS. They are excit- 
ing, easy, difficult, and an enduring source of fascinating structures and new ideas, 
as illustrated by recent work on fractal homeomorphisms — see the second set of 
color plates and videos on YouTube. They provide easy-to-state hard problems 
in pure mathematics: for example what is the fractal dimension of the attractor 
of an IFS of two affine maps in two dimensions? They yield surprisingly simple 
approaches to otherwise intractable problems in computer graphics and model- 
ling of natural objects: for example the collage theorem may be used to model 
Ediacaran rangemorphs. 

I teach the course in two streams, one for undergraduate honors mathematics 
students, and one for third year undergraduates in the sciences and engineering. 
Typically there are three lectures per week. One lecture per week is devoted to 
honors mathematics, is focused on proofs of theorems, and current research topics 
that relate to the material in this book. For example, when covering affine trans- 
formations, the honors lecture may concern the joint spectral radius of a set of 
linear maps. All students attend the other two lectures, which cover the main defi- 
nitions, theorems and geometrical intuitions. For example, such a pair of lectures 
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may include: the definition of an affine transformation in two dimensions, the 
fundamental theorem that such an invertible affine transformation is uniquely 
determined by its action on three non collinear points, and conditions under 
which an affine transformation is a contraction in some metric. There are also 
weekly tutorials, bi-weekly problem sets, and occasional computer lab meet- 
ings where students are bewitched by the chaos game algorithm, the collage 
theorem, fractal homeomorphisms, and the relationship between the logistic 
map and the Mandelbrot set. I believe that the interplay between simple com- 
puter programs that produce graphical output and the process of mathemati- 
cal conjecture and proof provide a valuable component to the education of 
students in many technical and mathematical areas. 

The core of the course involves Chapters II, III, [V and V. Chapter II cen- 
ters on metric spaces and topological concepts; it builds to the realization 
that the space of subsets of a compact metric space is itself a compact metric 
space and explores the Hausdorff metric. One of the themes in Chapter II 
is that, since fractals are subsets of spaces such as JR’, we need topology to 
describe important properties of sets and subsets, such as the properties of 
being open, compact, connected, perfect, and so on. Chapter III begins with 
a discussion of various types of transformations, for example affine trans- 
formations, Mobius transformations, and analytic transformations. Then the 
contraction mapping theorem is explained and explored. After this, iterated 
function systems are introduced; the contraction mapping theorem is used to 
show that each hyperbolic IFS has a unique attractor. Many important frac- 
tals are revealed to be attractors of iterated function systems. The chaos game 
algorithm, and the photocopy machine algorithm, are introduced as conve- 
nient ways to calculate the attractors of IFSs. Then the students learn how to 
find IFSs, whose attractors approximate given sets, with the aid of the collage 
theorem. Designing, calculating, and understanding IFS attractors is a core 
component of the course, and students are often delighted to understand how 
to design attractors to order. In many cases that I have known students find 
their own interesting new examples or applications. 

Chapter IV ties together IFS theory with some elementary ideas in dynami- 
cal systems. The relationship between the chaos game algorithm, on the one 
hand, and iteration of many-to-one dynamical systems, such as the logistic 
map, provides a bridge between fractal geometry and chaotic dynamics. The 
relationship between symbolic dynamics and IFS attractors plays a key role. 
In particular the action of the shift map on code space is related to the action 
of a chaotic dynamical system on the attractor of an IFS. 

Chapter V provides an introduction to fractal dimension. The honors lec- 
ture on this topic provides more material concerning the Hausdorff dimen- 
sion, based on the book Fractal Geometry: Mathematical Foundations and 
Applications by Kenneth Falconer. All students are encouraged to try to 
determine, either experimentally or theoretically, a fractal dimension of some 


Introduction to the Dover Edition 


object of interest to them, either a mathematical object or a notion such as 
the coastline of Australia. 

The second half of the course involves a sequence of lectures on special 
topics: fractal interpolation, Julia sets and the Mandelbrot set (with iPad 
demonstrations), IFSs with probabilities and measure theory attractors, 
fractal transformations and homeomorphisms, fractals on hyperspaces and 
superfractals, the fundamental theorem of fractal geometry, fractal image 
compression, analysis of Newton’s method, masked iterated function sys- 
tems, and so on. These topics are treated as applications of IFS theory and 
they are taught independently of one another. Students are encouraged to do 
a project that involves going into depth on one of these topics. 

Please visit www.superfractals.com for additional materials and a list of 
corrections. 


Michael Fielding Barnsley 
Canberra, 2012 
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Much has changed in the world of fractals, computer graphics and modern mathemat- 
ics since the first edition of Fractals Everywhere appeared. The company Iterated 
Systems, Inc., founded by Michael Barnsley and Alan Sloan, is now competing in the 
image compression field with both hardware and software products that use fractal 
geometry to compress images. Indeed, there is now a plethora of texts on subjects like 
fractals and chaos, and these terms are rapidly becoming "household words." 

The fundamental approach to fractal geometry through iterated function systems 
remains sound as an introduction to the subject. This edition of Fractals Everywhere 
leaves this approach largely as it stands. One still needs a grounding in concepts in 
metric space theory and eventually (see Chapter IX) measure theory to get a working 
understanding of the subject. However, there have been several additions to help ease 
and broaden the reader's development. 

Primary to these is the addition of answers to the mathematical problems. These 
were done largely by starting at one end of the book writing the answers until the other 
cover was reached. Most of the answers found in the key have been worked over at 
least twice, in hopes improving the accuracy of the key. Every effort has been make 
to rely solely on the material presented ahead of each problem, although in a few of 
the harder problems some concepts have been introduced in the answers themselves. 
These are not considered necessary to the development of the main thread of the text; 
however, if the reader finds some areas of mathematics touched on in looking at the 
presented solutions which extend the feeling for the subject, the key has served its 
purpose. 

In addition the the answer key, there have been some other changes as well. In 
Chapter III, section 11, the main theorem has been qualified. The reader with more 
mathematical background will recognize that the additional Lipshitz condition 
satisfies the need for equicontinuity in Theorem 11.1. This is not the only way to 
satisfy it, just the clearest in terms of the presumed mathematical background. 


XVI 


Foreword to the Second Edition 


There have been problems added to several chapters to develop the idea of 
Cartesian products of code spaces. This was done because it helps bridge the gap 
between IFS theory and the reversible systems found in physical chaos, and because 
it presents an interesting way of looking the the Random Iteration Algorithm in 
Chapter IX. The thread of these problems begins in Chapter II, leads up to the baker's 
transformation in Chapter IV, and is completed as an example in Chapter IX. 
Additional problems were added in Chapter III to develop some basic properties of 
eigenvalues and eigenvectors, which can be useful in examining dynamics both from 
the point of view described in the text, and elsewhere. It is hoped that with these 
additional tools those readers whose goals are application-oriented will come away 
with more at their disposal, while the text itself will retain its readable style. 

I would like to thank Lyman Hurd for many useful discussions about the 
topological nature of nonempty compact sets, and John Elton for his patience while 
I ran many of my new examples and problems past him to check them and to check 
the "excitement level” of the additional material. 


Hawley Rising 


It seems now that deterministic fractal geometry is racing ahead into the serious 
engineering phase. Commercial applications have emerged in the areas of image 
compression, video compression, computer graphics, and education. This is good 
because it authenticates once again the importance of the work of mathematicians. 
However, sometimes mathematicians lose interest in wonderful areas once scientists 
and engineers seem to have the subject under control. But there is so much more 
mathematics to be done. What is a useful metric for studying the contractivity of the 
vector recurrent IFS of affine maps in R?? What is the information content of a 
picture? Measures, pictures, dreams, chaos, flowers and information theory—the 
hours of the days keep rushing by: do not let the beauty of all these things pass us by too. 


Michael Fielding Barnsley 
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Fractal geometry will make you see everything differently. There is danger in read- 
ing further. You risk the loss of your childhood vision of clouds, forests, galaxies, 
leaves, feathers, flowers, rocks, mountains, torrents of water, carpets, bricks, and 
much else besides. Never again will your interpretation of these things be quite the 
same. 

The observation by Mandelbrot [Mandelbrot 1982] of the existence of a “Geom- 
etry of Nature” has led us to think in a new scientific way about the edges of clouds, 
the profiles of the tops of forests on the horizon, and the intricate moving arrange- 
ment of the feathers on the wings of a bird as it flies. Geometry is concerned with 
making our spatial intuitions objective. Classical geometry provides a first approx- 
imation to the structure of physical objects; it is the language that we use to com- 
municate the designs of technological products and, very approximately, the forms 
of natural creations. Fractal geometry is an extension of classical geometry. It can 
be used to make precise models of physical structures from ferns to galaxies. Fractal 
geometry is a new language. Once you can speak it, you can describe the shape of a 
cloud as precisely as an architect can describe a house. 

This book is based on a course called “Fractal Geometry,” which has been taught 
in the School of Mathematics at the Georgia Institute of Technology for two years. 
The ‘course is open to all students who have completed two years of calculus. It 
attracts both undergraduate and graduate students from many disciplines, includ- 
ing mathematics, biology, chemistry, physics, psychology, mechanical engineering, 
electrical engineering, aerospace engineering, computer science, and geophysical 
science. The delight of the students with the course is reflected in the fact there is 
now a second course, entitled “Fractal Measure Theory.” The courses provide a com- 
pelling vehicle for teaching beautiful mathematics to a wide range of students. 

Here is how the course in Fractal Geometry is taught. The core is Chapter II, 
Chapter III, sections 1-5 of Chapter IV, and sections 1-3 of Chapter V. This is 
followed by a collection of delightful special topics, chosen from Chapters VI, VII, 
and VIII. The course is taught in 30 one-hour lectures. 


2 


Chapter | 


Introduction 


Chapter II introduces the basic topological ideas that are needed to describe sub- 
sets to spaces such as R*. The framework is that of metric spaces; this is adopted 
because metric spaces are both rigorously and intuitively accessible, yet full of 
suprises. They provide a suitable setting for fractal geometry. The concepts intro- 
duced include openness, closedness, compactness, convergence, completeness, con- 
nectedness, and equivalence of metric spaces. An important theme concerns proper- 
ties that are preserved under equivalent metrics. Chapter IT concludes by presenting 
the most exciting idea: a metric space, denoted 1, whose elements are the nonempty 
compact subsets of a metric space. Under the right conditions this space is complete, 
sequences converge, and fractals can be found! 

Chapter III deals with transformations on metric spaces. First, the goal is to de- 
velop intuition and practical experience with the actions of elementary transforma- 
tions on subsets of spaces. Particular attention is devoted to affine transformations 
and Mobius transformations in R*. Then the contraction mapping principle is re- 
vealed, followed by the construction of contraction mappings on 7. Fractals are 
discovered as the fixed points of certain set maps. We learn how fractals are gener- 
ated by the application of “simple” transformations on “simple” spaces, and yet they 
are geometrically complicated. We explain what an iterated function system (IFS) 
is, and how it can define a fractal. Iterated function systems provide a convenient 
framework for the description, classification, and communication of fractals. Two 
algorithms, the “Chaos Game” and the Deterministic Algorithm, for computing pic- 
tures of fractals are presented. Attention is then turned to the inverse problem: given 
a compact subset of R’, fractal, how do you go about finding a fractal approximation 
to it? Part of the answer is provided by the Collage Theorem. Finally, the thought of 
the wind blowing through a fractal tree leads to discovery of conditions under which 
fractals depend continuously on the parameters that define them. 

Chapter IV is devoted to dynamics on fractals. The idea of addresses of points on 
certain fractals is developed. In particular, the reader learns about the metric space to 
which addresses belong. Nearby addresses correspond to nearby points on the frac- 
tal. This observation is made precise by the construction of a continuous transforma- 
tion from the space of addresses to the fractal. Then dynamical systems on metric 
spaces are introduced. The ideas of orbits, repulsive cycles, and equivalent dynam- 
ical systems are described. The concept of the shift dynamical system associated 
with an IFS is introduced and explored. This is a visual and simple idea in which the 
author and the reader are led to wonder about the complexity and beauty of the avail- 
able orbits. The equivalence of this dynamical system with a corresponding system 
on the space of addresses is established. This equivalence takes no account of the ge- 
ometrical complexity of the dance of the orbit on the fractal. The chapter then moves 
towards its conclusion, the definition of a chaotic dynamical system and the realiza- 
tion that “most” orbits of the shift dynamical system on a fractal are chaotic. To this 
end, two simple and delightful ideas are shown to the reader. The Shadow Theorem 


illustrates how apparently random orbits may actually be the “shadows” of determin- 
istic motions in higher-dimensional spaces. The Shadowing Theorem demonstrates 
how a rottenly inaccurate orbit may be trailed by a precise orbit, which clings like a 
secret agent. These ideas are used to make an explanation of why the “Chaos Game” 
computes fractals. 

Chapter V introduces the concept of fractal dimension. The fractal dimension of 
a set is a number that tells how densely the set occupies the metric space in which it 
lies. It is invariant under various stretchings and squeezings of the underlying space. 
This makes the fractal dimension meaningful as an experimental observable; it pos- 
sesses a certain robustness and is independent of the measurement units. Various 
theoretical properties of the fractal dimension, including some explicit formulas, 
are developed. Then the reader is shown how to calculate the fractal dimension of 
real-world data, and an application to a turbulent jet exhaust is described. Lastly the 
Hausdorff-Besicovitch dimension is introduced. This is another number that can be 
associated with a set. It is more robust and less practical than the fractal dimension. 
Some mathematicians love it; most experimentalists hate it; and we are intrigued. 

Chapter VI is devoted to fractal interpolation. The aim of the chapter is to teach 
the student practical skill in using a new technology for making complicated curves 
and fitting experimental data. It is shown how geometrically complex graphs of con- 
tinuous functions can be constructed to pass through specified data points. The func- 
tions are represented by succinct formulas. The main existence theorems and com- 
putational algorithms are provided. The functions are known as fractal interpolation 
functions. It is explained how they can be readily computed, stored, manipulated 
and communicated. “Hidden variable” fractal interpolation functions are introduced 
and illustrated; they are defined by the shadows of the graphs of three-dimensional 
fractal paths. These geometrical ideas are extended to introduce space-filling curves. 

Chapter VII gives an introduction to Julia sets, which are deterministic fractals 
that arise from the iteration of analytic functions. The objective is to show the reader 
how to understand these fractals, using the ideas of Chapters III and IV. In so do- 
ing we have the pleasure of explaining and illustrating the Escape Time Algorithm. 
This algorithm is a means for computergraphical experimentation on dynamical sys- 
tems that act on two-dimensional spaces. It provides illumination and coloration, a 
seachlight to probe dynamical systems for fractal structures and regions of chaos. 
The algorithm relies on the existence of “repelling sets” for continuous transforma- 
tions which map open sets to open sets. The applications of Julia sets to biological 
modelling and to understanding Newton’s method are considered. 

Chapter VIII is concerned with how to make maps of certain spaces, known 
as parameter spaces, where every point in the space corresponds to a fractal. The 
fractals depend “smoothly” on the location in the parameter space. How can one 
make a picture that provides useful information about what kinds of fractals are 
located where? If both the space in which the fractals lie and the parameter space 
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are two-dimensional, the parameter space can sometimes be “painted” to reveal an 
associated Mandelbrot set. Mandelbrot sets are defined, and three different examples 
are explored, including the one discovered by Mandelbrot. A computergraphical 
technique for producing images of these sets is described. Some basic theorems are 
proved. 

Chapter IX is an introduction to measures on fractals and to measures in general. 
The chapter is an outline that can be used by a professor as the basis of a course in 
fractal measure theory. It can also be used in a standard measure theory course as 
a source of applications and examples. One goal is to demonstrate that measure the- 
ory is a workaday tool in science and engineering. Models for real-world images can 
be made using measures. The variations in color and brightness, and the complex 
textures in a color picture, can be successfully modelled by measures that can be 
written down explicitly in terms of succinct “formulas.” These measures are desir- 
able for image engineering applications, and have a number of advantages over non- 
negative “density” functions. Section 1 provides an intuitive description of measures 
and motivates the rest of the chapter. The context is that of Borel measures on com- 
pact metric spaces. Fields, sigma-fields, and measures are defined. Carathéodory’s 
extension theorem is introduced and used to explain what a Borel measure is. Then 
the integral of a continuous real-valued function, with respect to a measure, is de- 
fined. The reader learns to evaluate some integrals. Next the space P of normalized 
Borel measures on a compact metric space is defined. With an appropriate metric, 
P becomes a compact metric space. Succintly defined contraction mappings on this 
space lead to measures that live on fractals. Integrals with respect to these measures 
can be evaluated with the aid of Elton’s ergodic theorem. The book ends with a de- 
scription of the application of these measures to computer graphics. 

This book teaches the tools, methods, and theory of deterministic geometry. It 
is useful for describing specific objects and structures. Models are represented by 
succinct ‘‘formulas.” Once the formula is known the model can be reproduced. We 
do not consider statistical geometry. The latter aims at discovering general statistical 
laws that govern families of similar-looking structures, such as a// cumulus clouds, 
all maple leaves, or a/l mountains. 

In deterministic geometry, structures are defined, communicated, and analyzed, 
with the aid of elementary transformations such as affine transformations, scalings, 
rotations, and congruences. A fractal set generally contains infinitely many points 
whose organization is so complicated that it is not possible to describe the set by 
specifying directly where each point in it lies. Instead, the set may be defined by 
“the relations between the pieces.” It is rather like describing the solar system by 
quoting the law of gravitation and stating the initial conditions. Everything follows 
from that. It appears always to be better to describe in terms of relationships. 
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Metric Spaces; 
Equivalent Spaces; 
Classification of Subsets; 
and the Space of Fractals 


1 Spaces 


In fractal geometry we are concerned with the structure of subsets of various very 
simple “geometrical” spaces. Such a space is denoted by X. It is the space on which 
we think of drawing our fractals; it is the place where fractals live. What is a fractal? 
For us, for now, it is just a subset of a space. Whereas the space is simple, the fractal 
subset may be geometrically complicated. 


Definition 1.1 A space X is a set. The points of the space are the elements of 
the set. 


Although this definition does not say it, the nomenclature “space” implies that 
there is some structure to the set, some sense of which points are close to which. We 
give some examples to show the sort of thing this may mean. Throughout this text R 
denotes the set of real numbers, and “ce” means “belongs to.” 


Examples 
1.1. X=R. Each “point” x € X is a real number, or a dot on a line. 


1.2. X= C[O, 1], the set of continuous functions that take the real closed interval 
[0, 1] = {x € R: 0 <x < 1} into the real line R. A “point” f € X is a function 


cts. 


f : (0, 1] > R. f may be represented by its graph. 
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Figure Il.1. A point x 
in R. 


Figure II.2. A point f 
in the space of continuous 
functions on [0, 1]. 


Notice that here f € X is not a point on the x-axis; it is the whole function. 
A continuous function on an interval is characterized by the fact that its graph is 
unbroken; as a picture it contains no rips or tears; it can be drawn without removing 
the pencil from the paper. 


1.3. X = R?’, the Euclidean plane, the coordinate plane of calculus. Any pair of real 
numbers x;, x2 € R determines a single point in R*. A point x € X is represented in 
several equivalent ways: 


x = (4), 22) = « = a point in a figure such as Figure II.3. 


The spaces in examples 1.1, 1.2, and 1.3 are each linear spaces: there is an 
obviously defined way, in each case, of adding two points in the space to obtain a 
new one in the same space. In 1.1 if x and y € R, then x + y is also in R; in 1.2 we 
define (f + g)(x) = f(x) + g(x); and in 1.3 we define 


x1 y1 x) ad 
xty= + = 
=] “ bade 


Similarly, in each of the above examples, we can multiply members of X by a scalar, 
that is, by a real number a € R. For example, in 1.2 (a@f)(x) = af (x) for any a € R, 
and af € C[O, 1] whenever f € C[O, 1]. Example 1.1 is a one-dimensional linear 
space; 1.2 is an oo-dimensional linear space (can you think why the dimension is 
infinite?); and 1.3 is a two-dimensional linear space. A linear space is also called a 
vector space. The scalars may be complex numbers instead of real numbers. 


1.4. The complex plane, X = C, where any point x € X is represented 
Xx =X, +1X), where i = / —1, 


for some pair of real numbers x;, x2 € R. Any pair of numbers x;, x2 € R determines 
a point of C. It is obvious that C is essentially the same as R?, but there is an implied 
distinction. In C we can multiply two points x, y and obtain a new point in C. 
Specifically, we define 


Xs y= (xX +1x2)(n1 + Ly2) = (X11 — X22) + i(x2y1 + X12) 


lg A= r the Riemann sphere. Formally, C=Cu {oo}; that is, all the points of 
C together with the “point at infinity.’ Here is a way of constructing and thinking 
about C. Place a sphere on the plane C, with the South Pole on the origin, and the 
North Pole N vertically above it. 

To a given point x € C we associate a point x’ on the sphere by constructing 
the straight line from N to x and marking where this line intersects the sphere. 
This associates a unique point x’ = h(x) with each point x € C. The transformation 
h:C-— sphere is clearly continuous in the sense that nearby points go to nearby 
points. Points farther and farther away from 0 in the plane C end up closer and 
closer to N. C consists of the completion of the range of h by including N on the 
sphere: The “point at infinity (00)” can be thought of as a giant circle, infinitely far 
out in C, whose image under h is N. It is easier to think of C being the whole of the 
sphere, rather than as the plane together with oo. It is of interest that h : C — sphere 


Figure 11.3. A point x 
in the space R?. 
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Figure Il.4. Construc- 
tion of a geometrical rep- 
resentation for the Rie- 
mann sphere. N is the 
North Pole and corre- 
sponds to the “point at 
infinity.” 


is conformal: it preserves angles. The image under h of a triangle in the plane is a 
curvaceous triangle on the sphere. 

Although the sides of the triangle on the sphere are curvaceous they meet in 
well-defined angles, as one can visualize by imagining the globe to be magnified 
enormously. The angles of the curvaceous triangle are the same as the corresponding 
angles of the triangle in the plane. 


Examples & Exercises 


1.6. X= 2, the code space on N symbols. Usually the symbols are the integers 
{0,1,2,..., MN — 1}. A typical point in X is a semi-infinite word such as 


XSL te Oi 2 isin 1 SO... 


There are infinitely many symbols in this sequence. In general, for a given element 
x € X, we can write 


Figure 11.5. A triangle 
in the plane corresponds 
to a curvaceous triangle 

on the sphere. 


XK = Xj RaKaXGXsXGXXS. © <5 where each x; € {0, 1,2,..., N — 1}. 


There are many names attached to this space because of its importance in a variety 
of branches of mathematics and physics. When each symbol is intended to represent 
a random choice from N possibilities, each point in this space represents a particular 
sequence of events, from a set of N possible events. In this case, the space is some- 
times called the space of Bernoulli trials. When there are several code spaces being 
referred to, it is customary to write the code space on N symbols as Ly. 


1.7. A few other favorite spaces are defined as follows. 
(a) A disk in the plane with center at the origin and with finite radius R > 0: 
@ = (x eR: x7 4 x? < R}. 
(b) A “filled” square: 
m= {xe R?:0<x, <1,0<x <1). 
(c) An interval: 
[a,b] = {x €R:a <x <b}, where a and b are real numbers with.a < b. 
(d) Body space: 
= {x € R°: coordinate points implied by a cadaver frozen in R°}. 
(e) Sierpinski space 


A = {x € R’:x isa point ona certain fixed Sierpinski triangle}. 


Sierpinski triangles occur often in this text. See, for example, Figure IV.94. 


1.8. Show that the examples in 1.5, 1.6, and 1.7 are not vector spaces, at least if 
addition and multiplication by reals are defined in the usual way. 


1.9. The notation A C X means A is a subset of X; that is, if x € A then x € X, or 
x € A implies x € X. The symbol @ means the empty set. It is defined to be the set 
such that the statement “x € @” is always false. We use the notation {x} to denote the 
set consisting of a single point x € X. Show that if x € X, then {x} is a subset of X. 


1.10. Any set of points makes a space, if we care to define it as such. The points are 
what we choose them to be. Why, do you think, have the spaces defined above been 
picked out as important? Describe other spaces that are equally important. 


1.11. Let X; and X» be spaces. These can be used to make a new space denoted 
X; X Xp, called the Cartesian product of X; and X>. A point in X; x X> is repre- 
sented by the ordered pair (x), x2), where x, € X; and x2 € X». For example, R? is 
the Cartesian product of R and R. 


1.12. As another example of a Cartesian product let 


A 1K,.9) Xs, YE SS EZ, 
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where X is the code space on N symbols. This has an interpretation in terms of 
the random choices mentioned in exercise 1.6. We call y the past and x the future. 
Then each element of the space represents a sequence of “coin tosses” (the coins are 
really more like N-sided dice); y represents the tosses that have already happened, 
beginning with the latest one, and x represents the tosses to come (beginning with 
the next one). If we rewrite the point (x, y) with a dot marking the “present,” 


~ ++ Y3¥2¥1 » X1X2X3... 


then the act of moving the dot to the right moves one future coin toss to a past coin 
toss: the obvious interpretation is that it represents flipping the coin. Moving the dot 
is called a shift, and the space is called the space of shifts on N symbols. It is also 
denoted 5, whether it is this space or code space that is being referred to is usually 
clear from context. In this book © will always be code space unless specifically 
mentioned. 


ne EE 


2 Metric Spaces 


We use the notation “W” to mean “for all.” We also introduce the notation A \ B to 
mean the set A “take away” the set B. That is, A \ B= {x € A: x ¢ B}. Weuse “=>” 
to mean “implies.” 


Definition 2.1 A metric space (X, d) is a space X together with a real-valued 
function d :X x X > R, which measures the distance between pairs of points x and 
y in X. We require that d obeys the following axioms: 


(1) d(x, y)=d(y,x) Vx, yeX 

(2) O< d(x, y) <0 Wx,yEX,xFy 

(3) d(x,x)=0 Vx eX 

(4) d(x, y) <d(x,z) +d(z, y) Vx, y,z EX. 


Such a function d is called a metric. 


The concept of shortest paths between points in a space, geodesics, is depen- 
dent on the metric. The metric may determine a geodesic structure of the space. 
Geodesics on a sphere are great circles; in the plane with the Euclidean metric they 
are straight lines. 


Examples & Exercises 


2.1. Show that the following are all metrics in the space X = R: 


(a) d(x, y) =|x — y| (Euclidean metric) 
(b) d(x, y)=2-|x—y| 
(c) d(x, y) = |x? — y’| 
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Figure II.6. (The angle 
6, and the distances r;, r2 
used to construct a metric 
on the punctured plane.) 
Acute angle subtended by 
two straight lines. 


2.2. Show that the following are metrics in the space X = R?: 


(a) d(x, y) = / (x1 — y1)? + (42 — y2)* (Euclidean metric) 
(b) d(x, y) = [x1 — yi| + |x2 — y2| (Manhattan metric) 


Why is the name “Manhattan” used in connection with (b)? 
2.3. Show that d(x, y) = |xy| does not define a metric in R. 
2.4. Let R* \ {O} denote the punctured plane. Define d(x, y) as follows: 
d(x, y) = |r — r2| + |4I, 


where r; = Euclidean distance from x to O, rz = Euclidean distance from y to 
O, O is the origin, and @ is the smallest angle subtended by the two straight lines 
connecting x and y to the origin. Show that d is a metric. 


2.5. On the code space & define 
Ixi — yil 
i (N + 1)! 


i= 


A(X, y) = d(X1xX2X3..., W1y2y3...) = 


Show that every pair of points in & is a finite distance apart. That is, d is indeed 
a function that takes & x & into R. Verify that (2, d) is a metric space. Try to 
envisage a possible geometry for &. (Do not confuse the possible meanings of the 
symbol 2; from its context it should be clear when it refers to code space and when 
it refers to summation.) 


*2.6. Define X = {(x, y): x, y € XZ}, the space of shifts on N symbols as in exer- 
cise 1.12. We can define a Euclidean distance by treating each coordinate as a base 
N + 1 number between 0 and 1. That is, we make the distance between (x, y) and 


(u, v) equal to 
oo 2 oo 2 
oS Xj — Uj 4 Yi — Vi 
( (N + “) (5. (N + ‘) 
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Show that this is indeed a metric space. 


2.7. nX =**. defined (x, y) to be the Euclidean length of the shortest path lying 


Ad 
entirely within X which connects x and y. Show that this is a metric. Discuss the 
utility of this metric in anatomy. The distance from a toenail to a fingertip does not 
depend much on the configuration of the body, whereas the usual spatial distance 


would. 


2.8. Invent a function d:m x #— R which is not a metric. Define a metric for 
the space ,namely an annulus, which makes it seem like the curved wall of a 


cylinder: | 


* 2.9. Show that a metric on X = C is defined by shortest great circle distances on 
the sphere. Compare the distances from 0, and from 1 + i, to oo. 


Definition 2.2 Two metrics d, and d, on a space X are equivalent if there exist 
constants 0 < c, < c2 < 00 such that 


cidi(x, y) < a(x, y) <codi(x,y), Wx, y) EX xX. 


Examples & Exercises 


2.10. Definition 2.2 looks unsymmetrical; it does not appear to make the same 
requirements on d; as it does on d2. Show that this is an illusion by establishing 
that if the definition holds then there are constants 0 < e; < e) < oo so that 


€d2(x, y) < d\(x, y) < e2d2(x, y), Vix,y)EXx X. 


2.11. Are the Manhattan and Euclidean metrics equivalent on m C R?? What about 
on R?? 


2.12. Show that the metric in exercise 2.4 is not equivalent to the Euclidean metric 
on @ \{0}. 

One notion underlying the concept of equivalent metrics is that any pair of equiv- 
alent metrics gives the same notion of which points are close together and which 
are far apart. It is as though there were a standard way for boundedly deforming the 
space, whereby distances are determined both before and after deformation. 

For example, consider a pair of points x and y inm C R?. Let the Euclidean dis- 
tance between these points be d\(x, y). Think of a thin rubber sheet lying over m. 
This sheet is stretched in some repeatable fashion, carrying copies of the points x 
and y to new locations, as illustrated in Figure II.7. The Euclidean distance between 
these moved points is called d2(x, y). The condition of equivalence is the require- 
ment that there is no extreme (infinite) stretching or compression of the space. 

This leads us to the idea of equivalent metric spaces: 


Definition 2.3 Two metric spaces (X;, d,) and (X, d2) are equivalent if there 
is a function h: X, — X> that is one-to-one and onto (i.e., it is invertible), such that 
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i eet si eles icles <n ates pasaneuanisaaeeiowcnacsial a aewiosd ena cndi as i eee 
Figure 11.7. A thin rub- 
ber sheet lies over the @ in 
the plane and is stretched. 
The Euclidean distances 
between points are de- 
termined before and af- 
ter deformation, yielding 
two metrics. These met- 
rics may be equivalent if 
the deformation leads to 
no rips, tears, or infinite 
stretching. 


Figure 11.8. This pic- 
ture suggests two metric 
7 spaces X, and X, that have 
; the same topology, but that 
ZZ» are not metrically equiv- 
alent: their “geometries” 
are deeply different. 


Point at the corner is missing. sae Bak 


The point which is not 
at the corner is pulled 
to infinity. 


X; and X, are homeomorphic : they have the same topology. 


But they are not equivalent : their geometries are deeply different. 


the metric d, on X, defined by 
d(x, y) =d,(h(x), h(y)), Vx, yeX 
is equivalent to d\. 


One can think of Definition 2.3 as requiring that X, and X) are related to one 
another by a bounded deformation, and nowhere is there an arbitrarily large com- 
pression or stretching; also, there is no overlapping, folding, or ripping. 
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Definition 2.4 A function f :X; — X2 from a metric space (Xj, dj) into a 
metric space (X2, d2) is continuous ff, for each € > O and x € Xi, there isad>0 
so that 


d(x, y) <5 > a(f (x), f(y) <<. 


If f is also one-to-one and onto, and thus invertible, and if also the inverse f | Off 
is continuous, then we say that f is a homeomorphism between X, and Xp. In such 
a case we say that X, and Xz are homeomorphic. 


The assertion that two spaces are equivalent metrically is much stronger than the 
statement that they are homeomorphic: to be equivalent there must be a bounded re- 
lationship between € and 5 independent of x. Homeomorphism is the equivalence re- 
lationship for topological properties; two spaces that are homeomorphic are identical 
topological spaces. Two metrics d; and d, on a given space X are identical topolog- 
ically (define the same topological space) if the identity map t : (X, d;) > (XK, do) 
given by (x) =x is a homeomorphism. 


Examples & Exercises 

2.13. Let X; =[1, 2] and X2 = [0, 1]. Let d, denote the Euclidean and let d(x, y) 
=2-|x — y| in X2. Show that (X;, d)) and (Xz, d2) are equivalent metric spaces. 
2.14. Show that (m, Euclidean) and (@, Manhattan) are equivalent metric spaces. 


2.15. Show that (C, Euclidean) and (R*, Manhattan) are equivalent metric spaces. 


2.16. Define two different metrics on the space X = (0, 1] ={x €eR:0<x<1} 
by 


1 1 
d\(x,y)=|x—y| and dQ(x,y)= = = - 


Show that (X, d;) and (X, d2) are not equivalent metric spaces. 


2.17. Figure II.9 suggests a subset (black) of (@, Euclidean). It also shows the space 
and set deformed by a metric equivalence. Discuss the properties of the image that 
would be invariant under (a) any metric equivalence, and (b) any homeomorphism. 
To what extent might one be able to “see” these invariances? Think about how much 
deformation an image can withstand while remaining recognizably the same image. 
Look at reflections of sets and images in the back of a shiny spoon. 


2.18. Show that if two metric spaces are metrically equivalent then there is a home- 
omorphism between them. 


* 2.19. Wecan define metrics on Z, our code space, by 


ore) 
Xi — Yi 
d(x, y) =) ae 


i=] 
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where k > 1. The most important choices for k are N + 1 and N, but other choices 
(mostly real numbers between these two) are useful. Show that the functions d, : 
& xX X& — R are metrics, and that these metrics are identical topologically. 


3 Cauchy Sequences, Limit Points, Closed Sets, Perfect Sets, and Complete 
Metric Spaces 


Fractal geometry is concerned with the description, classification, analysis and ob- 
servation of subsets of metric spaces (X, d). The metric spaces are usually, but not 
always, of an inherently “simple” geometrical character; the subsets are typically 
geometrically “complicated.” There are a number of general properties of subsets of 
metric spaces, which occur over and over again, which are very basic, and which 
form part of the vocabulary for describing fractal sets and other subsets of metric 
spaces. Some of these properties, such as openness and closedness, which we are 
going to introduce, are of a topological character. That is to say, they are invariant 
under homeomorphism. 


Figure 11.9. What fea- 
tures of the set (black) are 
invariant under a metric 
equivalence transforma- 
tion? Two sets that are 
metrically equivalent to 
(a) are shown in (b) and 


(c). 


(a) 
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Figure 11.9. (b) 


Figure 11.9. (c) 
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Fo. us what is important, however, is that there is another class of properties that 
are invariant under metric space equivalence. These include openness, closedness, 
boundedness, completeness, compactness, and perfection; these properties are intro- 
duced in this and the next section. Later we will discover another such property, the 
fractal dimension of a set. If a subset of a metric space has one of these properties, 
and the space is deformed with bounded distortion, then the corresponding subsets 
in the deformed space still have that same property. 

We are also about another business in this section. In our search for fractals we 
are always going to look in a certain type of metric space known as “complete.” We 
need to understand this concept. 


Definition 3.1 A sequence {x,}°°, of points in a metric space (X, d) is called a 
Cauchy sequence if, for any given number € > 0, there is an integer N > 0 so that 


d(Xn,Xm) <€ for alln,m> N. 


In other words, the further along the sequence one goes, the closer together be- 
come the points in the sequence. Mentally one pictures something like the image in 
Figure II.10. 

However, just because a sequence of points moves closer together as one goes 
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Xqe 
magnify 
Xie Xoe magnify Kee to revea 
Xe° s~I1to reveal /X- 7. X10 
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9° 13° 


Infinite sequence of 
points in X 


Figure 11.10. Image 
representing succes- 

sive magnifications on 

a Cauchy sequence, an 
infinite sequence of points 
in X. Just because the 
points are getting closer 
and closer together as one 
looks in at higher mag- 


nification does not mean 
that there is a point x to 
which the sequence is 
converging! 


along the sequence, we must not infer that they are approaching a point. Perhaps 
they are trying to approach a point that is not there? 

Definition 3.2 A sequence {x,}°°, of points in a metric space (X, d) is said to 
converge to a point x € X if, for any given number é€ > 0, there is an integer N > 0 
so that 


d(Xn,X) <€ for alln > N. 


In this case the point x € X, to which the sequence converges, is called the limit of 


the sequence, and we use the notation 


X= Im 2X 
n—0oO 


The limit x of a convergent sequence {x,}"°., has this property: let 
B(x,e)={yeX: d(x, y) <e€} 


denote a closed ball of radius € > 0 centered at x, as illustrated in Figure II.11. 

Any such ball centered at x contains all of the points x, after some index N, 
where N typically becomes larger and larger as « becomes smaller and smaller. See 
Figure I1.12. 


Theorem 3.1 [fa sequence of points {x,}°°, in a metric space (X, d) converges 
to a point x € X, then {x,}"°_, is a Cauchy sequence. 


; oor 


(c) 


(a) 


(b) 


(d) 


Figure Il.11. Uncele- 
brated small ball B(x, €) 
with its center at x and 
radius €. Beware! Balls 
do not usually look like 
balls. It depends on the 
metric and on the space. 
Balls (a)-(c) represent 
balls (marked in black) in 
spaces X that are subsets 
of R*, with the Euclidean 
metric. In (a) X has a 
ragged boundary, viewed 
as a subset of R?. In (b) the 
point x € X is isolated. In 
(c) X is a curvaceous Sier- 
pinski triangle. The ball 
depicted in (d) is in R?, 
but the metric is d(x, y) = 
max{|x; — yi|, |x2 — yal}. 
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Figure 11.12. Magni- Sans ZOOM IN 
fying glass looking at a fo oe FOREVER 
magnifying glass near a : 

limit point. ore points of 


_MAGNIFYING 
GLASS 


Definition 3.3 A metric space (X,d) is complete if every Cauchy sequence 
{x,}°2, in X has a limit x € X. 


In other words, there actually exists, in the space, a point x to which the Cauchy 
sequence is converging. This point x is of course the limit of the sequence. If {x,}7° 
is a Cauchy sequence of points in X and if X is complete, then there is a point x € X 
such that, for each « > 0, B(x, €) contains x, for infinitely many integers n. 

We will sometimes use the notation {x,} in place of {x,}°C, and lim in place of 
lim,—>o0 when it is clear from the context what the domain of the index is. 


Examples & Exercises 


3.1. Prove that if {x,}°2, is a Cauchy sequence of points in X and if X is complete, 
then there is a point x € X such that, for each € > 0, B(x, €) contains x, for infinitely 
many integers n. 


%* 3.2. Show that (IR, Euclidean metric) is a complete metric space. 

3.3. Show that (R?, Euclidean metric) is a complete metric space. 

3.4. Show that (@, Euclidean metric) is a complete metric space. 

3.5. Show that (C, metric on sphere) is a complete metric space. 

3.6. Show that (x, code space) is a complete metric space. 

3.7. (C[0, 1], D) is a complete metric space, where the metric D is defined by 
D(f, g) =max{| f(s) — g(s)|: 5 € [0, 1]}. 

3.8. Let (X;, d,) and (X2, d)) be equivalent metric spaces. Suppose (X1, d;) is com- 

plete. Show that (Xz, d2) is complete. 


3.9. Show that there are many different “shortest paths” between most pairs of 
points in (@, Manhattan). 


3.10. Prove Theorem 3.1. 


3.11. Prove that any sequence in a metric space can have at most one limit. 


4 Compact Sets, Bounded Sets, Open Sets, and Boundaries 


Definition 3.4 Let S Cc X be a subset of a metric space (X, d). A point x € X is 
called a limit point of S if there is a sequence {x,}°°_, of points x, € S \ {x} such that 
WM nan Xe =F: | 

Definition 3.5 Let S Cc X be a subset of a metric space (X, d). The closure of S, 
denoted S, is defined to be § = S U {Limit points of S}. S is closed if it contains all 
of its limit points, that is, S = S. S is perfect if it is equal to the set of all its limit 
points. 


Exercises & Examples 


3.12. Take the sequence {x, = +}°, and show that it converges in the metric 
space ((0, 1], Euclidean) but not in the metric space ((0, 1], Euclidean). 


3.13. A metric space (X,d) consists of a single point X = {a}, together with a 
metric defined by d(a,a) = 0. Show that X contains a Cauchy sequence and the 
limit of the Cauchy sequence, but that it possesses no limit points. Hence show that 
X is closed and complete but not perfect. 

3.14. Show that the sequence {x, = n}r~_, has no limit in (IR, Euclidean), but that it 
does when the points are treated as belonging to (C, spherical). 

3.15. Show that if h : X; — X, makes the metric spaces (X;,d,) and (Xp, d>) 
metrically equivalent, then the statements “x € X, is a limit point of § C X,” and 
“h(x) € X is a limit point of h(S) C X,” are equivalent. Here we use the notation 
h(S) = {h(s):s € S}. 

3.16. Find all of the limit points of the set {x, = (4 + (—1)", 2 + (-1)”"):n= 
1, 2,3, ...} in the metric space (@ , Euclidean). 

3.17. Show that the subset S = {x = + :n=1,2,3,...} is closed in ((0, 1], Eu- 
clidean) . 

3.18. Show that 5 = [0, 1] is a perfect subset of (R, Euclidean). 

3.19. Show that S = {4+:n=1,2,3,...} U {0} is not a perfect subset of (R, Eu- 
clidean), but that 5 = S. 


3.20. Show that S = & is a perfect subset of (X, code space metric). 


3.21. Let S be a subset of a complete metric space (X, d). Then (S, d) is a metric 
space. Show that (S, d) is complete if, and only if, S is closed in X. 


4 Compact Sets, Bounded Sets, Open Sets, and Boundaries 


We continue the description of basic properties that are to be used to describe sets 
and subsets of metric spaces. Where are the fractals? What are they? They are ev- 
erywhere and soon you will be able to see them. Not just the pictures, which are 
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shadows of fractals, but in your mind’s eye you will find what and where they really 
are. 


Definition 4.1 Let S Cc X be a subset of a metric space (X,d). S is compact if 
every infinite sequence {x,,}°°, in S contains a subsequence having a limit in S. 


Definition 4.2 Let S Cc X be a subset of a metric space (X, d). S is bounded if 
there is a point a € X and anumber R > 0 so that 


d(a,x) < RVx € X. 


Definition 4.3 Let S Cc X be a subset of a metric space (X,d). S is totally 
bounded if, for each € > 0, there is a finite set of points {y, y2,..., Yn} C S such 
that whenever x € X, d(x, y;) < € for some y; € {y1, y2,-.-, Yn}. This set of points 
{y1, ¥2,---, Yn} is called an €-net. 


Theorem 4.1 Let (X,d) be a complete metric space. Let S Cc X. Then S is 
compact if and only if it is closed and totally bounded. 


Proof Suppose that S is closed and totally bounded. Let {x; € S} be an infinite 
sequence of points in S. Since S is totally bounded we can find a finite collection of 
closed balls of radius 1 such that S is contained in the union of these balls. By the 
Pigeon-Hole Principle (a huge number of pigeons laying eggs in two letter boxes > 
at least one letter box contains a huge number of angry pigeons), one of the balls, say 
B,, contains infinitely many of the points x,,. Choose Nj so that xy, € B;. It is easy to 
see that B,; M S is totally bounded. So we can cover B, M S by a finite set of balls of 
radius 1/2. By the Pigeon-Hole Principle, one of the balls, say Bz, contains infinitely 


-many of the points x,. Choose N2 so that xy, € B> and N> > N,. We continue in this 


fashion to construct a nested sequence of balls, 


B, > Bo > B3 D Ba D Bs D B65 D BF D B38 D BoD:::DB,D::: 


where B,, has radius el and a sequence of integers {N,,}"°., such that xy, € Bp. It 


is easy to see that {xy,}°°,, which is a subsequence of the original sequence {x,}, is 
a Cauchy sequence in S. Since S is closed, and X is a complete metric space, S is 
complete as well; see exercise 4.2. So {x,} converges to a point x in S. (Notice that 
x is exactly (\°-, B,.) Thus, S is compact. 

Conversely, suppose that S is compact. Let « > 0. Suppose that there does not 
exist an e-net for S. Then there is an infinite sequence of points {x, € S} with 
d(x;,x;) = € for alli 4 j. But this sequence must possess a convergent subsequence 
{xn,}. By Theorem 3.1 this sequence is a Cauchy sequence, and so we can find a pair 
of integers N; and Nj; with N; ~ N; so that d(xn,, Xn;) <eé. But d(Xn,; XN;) = €, SO 
we have a contradiction. Thus there does exist an €-net. This completes the proof. 


Definition 4.4 Let S c X be a subset of a metric space (X, d). S is open if for 
each x € S there is ane > 0 such that B(x,€)={y €X: d(x, y) <e} CS. 
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Figure 11.13. A trans- 
formation 6 between two 

_ metric spaces, establish- 
ing the equivalence of the 

ye spaces and carrying the 
closed set S onto a closed 


STILL set 0(S). 
CLOSED 


e(S) 


Examples & Exercises 


4.1. Show that if (X, d) is a metric space then X is closed. Give an example of a 
metric space that is closed but not complete. 


4.2. Let S be a closed subset of a complete metric space (X, d). Show that (S, d) 
is a complete metric space. 


4.3. Let (X;, d,) and (X2, d)) be equivalent metric spaces, and let a transformation 
0: X; — Xp» provide this equivalence. Let S C X; be closed. Show that 6(S) = 
{0(s) : s € S} is closed. This idea is illustrated in Figure II. 13. 


4.4. If (X, d) is a metric space, then X is open. 
Proof Let x € X. Clearly B(x, 1) C X. 


4.5. If (X,d) is a metric space, then “S C X is open” is the same as “X \ S is 
closed.” 


Proof Suppose “S c X is open.” Suppose {x,} is a sequence in X \ S with a limit 
x € X. We must show that x € X \ S. Assume that x € S. Then every ball B(x, €) 
with € > 0 contains a point x, € X \ S, which means that S is not open. This is 
a contradiction. The assumption is false. Therefore x € X \ S. Therefore “X \ S is 
closed.” 

Suppose “X \ S is closed.” Let x € S. We want to show there is a ball B(x, €) C S. 
Assume there is no ball B(x, €) C S. Then for every integer n = 1, 2,3,..., we can 
find a point x, € B(x, ) M (X \ S). Clearly {x,} is a sequence in X \ S, with limit 
x € X. Since X \ S is closed we conclude that x € X \ S. This contradicts x € S. 
The assumption that there is no ball B(x, ¢€) C S is false. Therefore there is a ball 
B(x, €) C S. Therefore “S is open.” 


4.6. Every bounded subset S of (IR*, Euclidean) has the Bolzano-Weierstrass prop- 
erty: “Every infinite sequence {x,}°° , of points of S contains a subsequence which 
is a Cauchy sequence.” The proof is suggested by the picture in Figure II.14. 

We deduce that every closed bounded subset of (IR*, Euclidean) is compact. In 
particular, every metric space of the form (closed bounded subset of R?, Euclidean) 
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F ig u re | i : 1 4 ; Demon- a Fe" : geen Hae a ate 


stration of the Bolzano- | CONTAINS 


Weierstrass Theorem. INFINITELY | 
(Government warning: SO ONE N 


OF THE 


This is not a proof.) 


CONTAINS 
INFINITELY 
MANY X,/'S 
SO ONE 


CONTAINS 
INFINITELY 
MANY X,)'S 


ZOOM ( MAGNIFY ): 


AND SO ON FOREVER 


is a complete metric space. Show that we can make a rigorous proof by using Theo- 
rem 4.1. Begin by proving that any bounded subset of R” is totally bounded. 


4.7. Let (X, d) be a metric space. Let f : X — X be continuous. Let A be a compact 
nonempty subset of X. Show that f(A) is a compact nonempty subset of X. (This 
result is proved later as Lemma 7.2 of Chapter III.) 


4.8. Let S Cc (Xj, d;) be open, and let (X2, d) be a metric space equivalent to 
(X,, d,), the equivalence being provided by a function h : X; — X. Show that h(S) 
is an open subset of Xp. 


4.9. Let (X,d) be a metric space. Let C C X be a compact subset of X. Let {C, : 
n = 1,2,3,...} be a set of open subsets of X such that “x € C” implies “x € C, 
for some n.” {C,,} is called a countable open cover of C. Show that there is a finite 
integer N so that “x € C” implies “x € C, for some integer n < N.” 


Proof Assume that an integer N does not exist such that “x € C” implies “x € C,, 
for some n < N.” Then for each N we can find 
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N 
xy €C\(JG. 


n=1 
Since {xy}%7_, is in C it possesses a subsequence with a limit y € C. Clearly y does 
not belong to any of the subsets C,,. Hence “y € C” does not imply “y € C,, for some 
integer n.” We have a contradiction. This completes the proof. 


The following even stronger statement is true. Let (X, d) be a metric space. Let 
C Cc X be compact. Let {C; :i € J} denote any collection of open sets such that 
whenever x € C, it is true that x € C; for some index i € 7. Then there is a finite sub- 
collection, say {C, C2, ..., C,} such that C C J, C;. The point is that the original 
collection of open sets need not even be countably infinite. A good discussion of 
compactness in metric spaces can be found in [Mendelson 1963], Chapter V. 


4.10. Let X = (0, 1) U {2}. That is, X consists of an open interval in R, together 
with an “isolated” point. Show that the subsets (0, 1) and {2} of (X, Euclidean) are 
open. Show that (0, 1) is closed in X. Show that {2} is closed in X. Show that {2} is 
compact in X but (0, 1) is not compact in X. 


Definition 4.5 Let S Cc X be a subset of a metric space (X, d). A point x € X is 
a boundary point of S if for every number € > 0, B(x, €) contains a point in X\ S 
and a point in S. The set of all boundary points of S is called the boundary of S and 
is denoted OS. 


Definition 4.6 Let S Cc X be a subset of a metric space (X,d). A point x € S is 
called an interior point of S if there is a number € > 0 such that B(x, €) C S. The set 
of interior points of S is called the interior of S and is denoted S°. 


Examples & Exercises 


4.11. Let S be a subset of a metric space (X, d). Show that 0S = 0(X \ S). Deduce 
that OX = @. 


4.12. Show that the property of being a boundary of a set is invariant under metric 
equivalence. 


4.13. Let (X, d) be the real line with the Euclidean metric. Let S denote the set of 
all rational points in X (i.e., real numbers that can be written A where p and q are 
integers with g # 0). Show that 0S = X. 


4.14. Find the boundary of C viewed as a subset of (C, spherical metric). 
4.15. Let S be a closed subset of a metric space. Show that 0S Cc S. 
4.16. Let S be an open subset of a metric space. Show that dSN S = @. 


4.17. Let S be an open subset of a metric space. Show that S° = §. Conversely, 
show that if S° = S then S is open. 


4.18. Let S be a closed subset of a metric space. Show that S = S°U@S. 
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Figure 11.15. How well ~ 

can topological concepts Metric Space X, the world 

such as open, boundary, ‘QS The coastline is 

etc., be used to model Stes the boundary of 

land, sea, and coastlines? the set called LAND 
and the set called 
SEA 


oe The land is the interio 


of the island. 
The wet stuff is the interior of the sea. 


4.19. Show that the property of being the interior of a set is invariant under metric 
equivalence. 


4.20. Show that the boundary of a set S in a metric space always divides the space 
into two disjoint open sets whose union, with the boundary 05S, is the whole space. 
Illustrate this result in the following cases, in the metric space (R*, Euclidean): (a) 
S = {(x, y) € R?: x7 + y* <1}; (6) S=R?’. 


4.21. Show that the boundary of a set is closed. 
* 4.22. Let S be a subset of a compact metric space. Show that dS is compact. 


4.23. Figure II.15 shows how we think of boundaries and interiors. What features 
of the picture are misleading? 


4.24. To what extent does Mercator’s projection provide a metric equivalence to a 
Cartesian map of the world? 


4.25. Locate the boundary of the set of points marked in black in Figure II.16. 


4.26. Prove the assertion made in the caption to Figure II.17. 


5 Connected Sets, Disconnected Sets, and Pathwise-Connected Sets 


Definition 5.1 A metric space (X, d) is connected if the only two subsets of X 
that are simultaneously open and closed are X and %. A subset S C X is connected 
if the metric space (S,d) is connected. S is disconnected if it is not connected. § 
is totally disconnected provided that the only nonempty connected subsets of S are 
subsets consisting of single points. 


Definition 5.2 Let S C X be a subset of a metric space (X,d). Then S is 
pathwise-connected if, for each pair of points x and y in S, there is a continuous 
function f : [0,1] — S, from the metric space ({0, 1], Euclidean) into the metric 
space (S,d), such that f (0) =x and f(1) = y. Such a function f is called a path 
from x to y in S. S is pathwise-disconnected if it is not pathwise-connected. 
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Figure 11.16. Should 
the black part be called 
open and the white part 
closed? Locate the bound- 
ary of the set of points 
marked in black. 
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or wig A, 


Figure 11.17. The in- 
terior of the “land” set is 
an open set in the met- 
ric space (Y = A 
Euclidean). The smaller 
filled rectangle denotes a 
subset Z = @ of Y. The 
intersection of the inte- 
rior of the land with Z is 
an open set in the met- 
ric space (Z, Euclidean), 
despite the fact that it in- 
cludes some points of the 
“border” of @. 
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One can also define simply connected and multiply connected. Let S be pathwise- 
connected. A pair of points x, y € S is simply connected in S if, given any two paths 
fo and f; connecting x, y in S, we can continuously deform fo to f; without leaving 
the subset 5. What does this mean? 

Let there be the two points x, y € S and the two paths fo, fi connecting x, y in 
S. In other words, fo, f; are two continuous functions mapping the unit interval 
(0, 1] into S so that fo(0) = f,;(0) =x and fo(1) = fi(1) = y. By a continuous 
deformation of fo into f; within S we mean a function g continuously mapping the 
Cartesian product [0, 1] x [0, 1] into S, so that 


(a) g(s,0) = fo(s)(O<s <1) 
(b) gs, D=fi)Oss <)) 
(c) g(0,t) =x(0<t <1) 
(d) g,t)=yO<t <1) 


Thus, we say that two points x, y in § are simply connected in S if, given any two 
paths fo, f; going from x to y in S, there exists a function g as just described. This 
idea is illustrated in Figure IT.18. 

If x, y are not simply connected in S, then we say that x, y are multiply connected 
in S. S itself is called simply connected if every pair of points x, y in S is simply 
connected in S. Otherwise, S is called multiply connected. In the latter case we can 
imagine that S contains a “hole,” as illustrated in Figure II.19. 


Examples & Exercises 


* 5.1. Show that the properties of being (pathwise) connected, disconnected, sim- 
ply connected, and multiply connected are invariant under metric equivalence. 


5.2. Show that the metric space (@, Euclidean) is simply connected. 


Figure 11.18. A path fo fp(s)=9(s,0) 
which connects the points 
x and y is continuously 
deformed, while remain- 
ing “attached” to x and 
y, to become a second 


path Fi: 


X=g(0,t) > Y=q(1,t) 


f,(s)=9(s.1) 
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Figure 11.19. Ina mul- 
tiply connected space 
there exist paths that 
cannot be continuously 
deformed from one to an- 
other. There is some kind 
of “hole” in the space. 


Figure 11.20. Locate 
the largest connected 

subsets of this subset 

of R?. 


6 The Metric Space (7/(X), 4): The Space Where Fractals Live 


~~ the dot at the corner ~—ss-t-r-e-t-c-h to oO ~—]_:=SséW Figure 11.21. A Cauchy 
ae eae oe sequence being preserved 
by a metric equivalence 
and destroyed by a certain 
homeomorphism. 


(X,d) Cauchy Sequence 
destroyed ! 


(X,d) contains a 
Cauchy Sequence 
metric ake 
equivalence | X,. 2° 


(X, d) Cauchy Sequence 
survived ! 


How do we know that the set of real numbers {d(x, y) : y € B} contains a min- 
imum value, as claimed in the definition? This follows from the compactness and 
nonemptyness of the set B € {((X). Consider the function f : B — R defined by 


f(y) =d(x, y) for all y € B. 


From the definition of the metric it follows that f is continuous, viewed as a trans- 
formation from the metric space (B,d) to the metric space (R, Euclidean). Let 
P = inf{ f(y): y € B}, where “inf” is defined in exercise 6.19, and also in Defi- 
nition 6.2 in Chapter III. Since f(y) > 0 for all y € B, it follows that P is finite. We 
claim there is a point y € B such that d(x, y) = P. We can find an infinite sequence 
of points {y, :2 = 1,2,3,...} C B such that f(y,) — P < = for each positive in- 
teger n. Using the compactness of B, we find that {y,:n = 1,2,3...} has a limit 
} € B. Using the continuity of f we discover that f()) = P, which is what we 
needed to show. 

Color Plate 2 shows a picture of the metric space (@, Manhattan). It has been 
colored as follows. Let F denote a certain subset of m whose “geometry” is that of a 
piece of a fern. Then the color of each point a € mis fixed by the value of d(a, F). 


Definition 6.3 Let (X, d) be a complete metric space. Let A, B € H(X). Define 
d(A, B) = max{d(x, B):x € A}. 
d(A, B) is called the distance from the set A € H(X) to the set B € H(X). 


Just as above, using the compactness of A and B, we can prove that this definition 
is meaningful. In particular, there are points x € A and y € B such that d(A, B) = 
d(x, ¥). 

Examples & Exercises 
6.3. Show that B, C € H(X), with B C C, implies d(x, C) < d(x, B). 
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6.4. Calculate d(x, B) if (X,d) is the space (IR, Euclidean), x € R? is the point 
(1, 1), and B is aclosed disk of radius 4 centered at the point (5, 0). 


6.5. Same as 6.4 above, but use the Manhattan metric. 


Q. 


6.6. Calculate d(x, B) if (X, d) is (R, Euclidean), x = 4, an 


B= {x,=3+(-1)"— 


Sa mt, 2,3, .: 347 3}. 

6.7. Let A, B € H(X), where (X,d) is a metric space. Show that, in general, 
d(A, B) #d(B, A). Conclude that d does not provide a metric on H(X). It is not 
symmetrical: the distance from A to B need not equal the distance from B to A. 


6.8. Figure II.23 shows two subsets A and B of (ac R?, Euclidean). A is the white 
part and B is the black part. (a) Estimate the location of a pair of points, x € A and 
y € B, such that d(x, y) = d(A, B). (b) Estimate the location of a pair of points, 
x € Aand y € B, such that d(x, y) =d(B, A). 


6.9. Figure II.24 shows two fern-like subsets, A and B, of (R*, Manhattan). Locate 
points x € A and y € B such that: (a) d(x, y) = d(A, B); (b) d(x, y) =d(B, A). 


Figure 11.22. Points in 
the space H(R*) may be 
interpreted as black-and- 
white images. Unions of 
points yield new points. 
Be careful with intersec- 
tions, however. 


The whole smiley 
face is a point 

in H(X). 

Call it xe H(X). 


This thin torso 
iS a point 
in H(X). 
Call it y< H(X). 


This fellow is 
Xu y. 

He is a single 

point in H(X). 


hal 
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6.10. Find d(France, U.S.A.) and d(U.S.A., France) on (C, spherical metric). 
Which is larger? Also compare d(Georgia, U.S.A.) to d(U.S.A., Georgia). 


6.11. Let (X, d) be a complete metric space. Let A and B be points in H(X) such 
that A 4 B. Show that either d(A, B) £0 or d(B, A) # 0. Show that if A Cc B then 
d(A, B) =0. 


6.12. Let (X, d) be a complete metric space. Show that if A, B and C € 7#((X) then 
BcC=>d(A,C) <d(A, B). (Hint: Use 6.3.) 


6.13. Let (X, d) be acomplete metric space. Show that if A, B, and C € #{(X) then 


Figure 11.23. This frac- 
tal image contains a 
pair of disjoint subsets 
of @ C R?, “black” and 
“white.” Let A denote 
the closure of the set in 
black and let B denote 
the closure of its comple- 
ment. Find a pair of points 
x €A and y € B, such 
that d(x, y) = d(A, B). 
Find a pair of points 

x € A and y € B such 
that d(x, y) = d(B, A). 
Why do we “close” the 
sets before we begin? 
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Figure 11.24. Finda 
pair of points x and y, 
one in the dark fern and 
one in the pale fern, such a. 
that the Hausdorff distance ‘th a : Hedi ae Bue x “<<. 

between the two fern thy aan , i Hi nt, ae: R... 
images is the same as 7 " ing Nh it ) ia, : ° 

the distance between the Li in I mh ‘ale she 
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d(AUB,C)=d(A,C) Vd(B,C). 
We use the notation x V y to mean the maximum of the two real numbers x and y. 


Proof d(A U B,C) = max{d(x,C): x € AU B} = max{d(x,C):x € A} V 
max{d(x, C):x € B}. 


6.14. Let A, B, and C belong to H(X), where (X, d) is a metric space. Show that 
d(A, B) <d(A,C)+d(C, B). 

Also determine whether or not the inequality 
d(A, B) < d(C, A) + d(C, B) 

is true in general. 


Definition 6.4 Let (X, d) be a complete metric space. Then the eins dis- 
tance between points A and B in H(X) is defined by 


h(A, B) =d(A, B) Vv d(B, A). 


Examples & Exercises 
6.15. Show that h is a metric on the space H(X). 


7 The Completeness of the Space of Fractals 


Proof Let A, B, C € H(X). Clearly h(A, A) = d(A, A) Vd(A, A) =d(A, A) = 
max{d(x, A): x € A}=0. A(A, B) =d(a, b) for some a € A and be B, using the 
compactness of A and B. Hence 0 < h(A, B) < oo. If A B we can assume there 
is anaéA so that a ¢ B. It follows that h(A, B) > d(A, B) > 0. To show that 
h(A, B) < h(A, C)+h(C, B) we first show that d(A, B) < d(A, C)+ d(C, B). We 
have for anyaeéA 

d(a, B) = min{d(a, b): be B} 
< min{d(a,c)+d(c,b):be BWceEC, 
= d(a,c) + min{d(c, b):b € B}Wc € C, so 
d(a, B) < min{d(a, c):c € C} + max{min{d(c, b):b € B}:c EC}, 
=d(a,C)+d(C, B), so 
d(A, B) =d(A,C)+d(C, B). 
Similarly, 
d(B, A) <d(B,C)+ d(C, A), whence 
h(A, B) =d(A, B) Vv d(B, A) < d(B,C) V d(C, B) + d(A, C) v d(C, A) 
=h(B,C)+h(A, C), as desired. 


6.16. Show that h(A U B, CU D) < h(A, C) Vv A(B, D), for all A, B, C, and De 
H(X). 


6.17. Show that h(A, B) =d(a, b) for some a € A and be B. 


6.18. The same situation as in 6.9, but this time locate a pair of points x € A and 
y € B such that d(x, )) =h(A, B), the Hausdorff distance from A to B. 


6.19. Let SC R, with S 4 8. The supremum of S is denoted by sup S. The infinum 
of S is denoted by inf S. If there is no real number greater than all of the numbers 
in S, then sup S = +00; otherwise sup S = min{x € R: x => sVs € S}. If there is 
no real number less than all of the numbers in S, then inf S = —oo; otherwise 
inf S = max{x € R: x <sVs € S}. Show that sup S and inf S are well defined. Show 
that if S is compact then sup S = max S and inf S = min S. Further exercises on sup 
and inf are given following Definition 6.2 in Chapter III. 

By replacing max by sup and min by inf, respectively, throughout the definition 
of the Hausdorff metric, define a “distance” between arbitrary pairs of subsets of a 
metric space. Give several reasons why this “distance” is not usually a metric. 
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7 The Completeness of the Space of Fractals 


We refer to (H(X), 4) as the “space of fractals.” It is too soon to be formal about 
the exact meaning of a “fractal.” At the present stage of development of science and 
mathematics, the idea of a fractal is most useful as a broad concept. Fractals are not 
defined by a short legalistic statement, but by the many pictures and contexts that 
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refer to them. For the first eight chapters of this book, any subset of (H(X), h) is a 
fractal. However, as with the concept of a “space,” more meaning is suggested than 
is formalized. 

In this section our principal goal is to establish that the space of fractals (H(X), h) 
is a complete metric space. We also want to characterize convergent sequences in 
H(X). To achieve these goals using only the tools introduced so far is quite difficult. 


Indeed, at this juncture, we want to introduce another notion; namely, the idea of 


extending certain Cauchy subsequences. 


Definition 7.1 Let S Cc X and let >0.ThenS+T ={y€X:d(x, y) <I for 
some x € S}. S+T is sometimes called, for example, in the theory of set morphol- 
ogy, the dilation of S by a ball of radius I’. 


Lemma 7.1 Let A and B belong to H(X) where (X, d) is a metric space. Let 
€ > 0. Then 


h(A,B)<eSACBt+e and BCA+eE. 


Proof Begin by showing that d(A, B) <e © ACB+e. Suppose d(A, B) 
<e. Then max{d(a, B):a € A} <e€ implies d(a, B) <e for all a ¢ A. Hence 
for each a € A we have ae B+e. Hence “AC B+.” Suppose “A C B+ €.” 
Consider d(A, B) = max{d(a, B):a € A}. Let ae A. Since A C B + €, there is 
a,b €B so that d(a, b) < «€ for all a € A. Hence d(a, B) < €. This is true for each 
a €é A.So“d(A, B) < €.” This completes the proof. 

Let {A, :n =1,2,..., 00} be a Cauchy sequence of sets in (H((X), h). That is, 
given € > 0, there is N so that n, m > N implies 


An +€ D Am and Am +€ D An, 


i.e., h(An, Am) < €. We are concerned with Cauchy sequences {x,}°° , in X with the 
property that x, € A, for each n. In particular, we need the following property that 
allows the extension of a Cauchy subsequence {xn,; € An,};2,, with the property that 
Xn, € An, for each j, to a Cauchy sequence {x, € A,}?°.,. 


Lemma 7.2 The Extension Lemma. Let (X, d) be a metric space. Let {A,, : 


n=1,2,...,00} be a Cauchy sequence of points in (H(X), h). Let {nn} be an 
infinite sequence of integers | 


O<n, <n <7n3<:::. 
Suppose that we have a Cauchy sequence {Xn, € An, : j = 1,2,3,...} in (X,d). 


Then there is a Cauchy sequence {x, € A,:n =1,2,...} such that xp, =Xn,, for 
au i a t.2. 5, .. .. 


Proof We give the construction of the sequence {x, € A,:n =1,2,...}. For 
each n € {1,2,...,,}, choose x, € {x € An : d(x, Xn,) = d(Xn,, An)}. That is, x, is 
the closest point (or one of the closest points) in A, to x,,. The existence of such a 
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Figure 11.25. The be- 
ginning of a Cauchy 
sequence {A,} of sets 

in H(R*) is shown. A 
Cauchy subsequence of 
points {x,,} belonging to 
a subsequence of the sets 
is also indicated. Make a 
photocopy of the figure, 
and mark on it the exten- 
sion of the subsequence of 
points to the visible sets in 
{Aj}. 


closest point is ensured by the compactness of A,,. Similarly, for each j € {2,3,...} 
and eachn € {n; + 1,..., mj41}, choose xX, € {x € Ay: d(x, Xn,) =d(Xn,, An)}. 

Now we show that {x,} has the desired properties, that it is indeed an extension 
of {x,,} to {An}. Clearly x,,; =2x,, and x, € A,, by construction. To show that it 
is a Cauchy sequence let € > 0 be given. There is N, so that ny,n; > N; implies 
A(Xn,» Xn;) < €/3. There is Nz so that m,n > N> implies 

d(Am, An) < €/3. 
Let N = max{Nj, N>} and note that, for m,n > N, 
d(Xm, Xn) = d (Xm, Xn;) + d(Xn,; Xn,) - A(Xn,; ands 

where m € {n;-1 +1,n;-1+2,...,nj} andn € {my_; +1, ny-1 +2, ..., my}. Since 
h(Am, An;) < €/3 there exists y € Ay ({Xn,} + €/3) so that d(%m, Xn,) < €/3. Sim- 
ilarly d(%n,, Xn) < €/3. Hence d(x,, X,) < € for all m,n > N. This completes the 
proof. 


Examples & Exercises 


7.1. A Cauchy sequence {A,} of sets in (H(R*), h) is sketched in Figure II.25. The 
underlying metric space is (R?, Euclidean). A Cauchy subsequence {x,, ; © An,} is 
also shown. Sketch, in the same figure, the extension {x,,} of this subsequence to 
{An}. 
7.2. Repeat 7.1 but this time with reference to Figure II.26. 

The central result toward which we have been driving is this: 


Theorem 7.1 The Completeness of the Space of Fractals. Let (X, d) 
be a complete metric space. Then (H(X), h) is a complete metric space. Moreover, 
if {A, € H(X)}" | is a Cauchy sequence, then 


A= lim A, € H(X) 
n->0o 
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can be characterized as follows: 


A= {x €X: there is a Cauchy sequence {x, € A,,} that converges to x}. 


Proof Let {A,} be a Cauchy sequence in 7#{(X) and let A be defined as in the 
statement of the theorem. We break the proof up into the following parts: 

(a) A#Q; 

(b) A is closed and hence complete since X is complete; 

(c) fore > O there is N such that forn > N, A CA, + €; 

(d) A is totally bounded and thus by (b) is compact; 

(e) lim A, = A. 

Proof of (a): We shall prove this part by proving the existence of a Cauchy se- 
quence {a; € A;} in X. Toward this end find a sequence of positive integers N, < 
N> < N3 <---< N, <--- such that 


1 
h(Am, An) < 5 form,n > Nj. 


Choose xy, € An,. Then since h(Ay,, An,) < $ we can find an xy, € Ay, such 
that d(xy,,Xn,) < 5. Assume that we have selected a finite sequence xy, € Ay,;i = 
1,2,...,k for which d(xy,_,,xn,) < ser. Then since h(Ay,, An,,,) < sr, and xy, € 
An,, we can find xy,,, € An,,, Such that d(xy,, X,4,) xr For example let xw,,, be 
the point in Ay,,, that is closest to x,. By induction we can find an infinite sequence 
{xy, € Av,} such that d(xy,, Xn,,,) < 37. To see that {x,,} is a Cauchy sequence in X, 


let € > 0 and choose N, such that )7;" v, x <e.Then form >n > WN, we have 


d(xy,. ty) = diy... En, OG, ci 4Nn 2) + *** + EOn,_,. £N,) 
— 1 
ey 5 <e 
i=N; 
By the Extension Lemma, there exists a convergent subsequence {a; € A;} for which 
an, = Xy,- Then lim a; exists and by definition is in A. Thus A # 9. 

Proof of (b): To show that A is closed, suppose {a; € A;} is a sequence that con- 
verges to a point a. We will show that a € A, hence making A closed. For each pos- 
itive integer i, there exists a sequence {x;,, € A,} such that lim,_, 40 X;,, = a;. There 
exists an increasing sequence of positive numbers {N;}°°, such that d(ay,,a) < t 


Figure 11.26. The same 
problem as for Fig- 
ure II.25. The sets {A,} 


look very different here. 
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Furthermore, there is a sequence of integers {m;} such that d(xy,.m,,4n,) < *, Thus 
d(XN,,m;,@) < 2/t. If we let ym; = XN,,m, We see that ym, € Am, and limi > Coyp, = 
a. By the Extension Lemma, {y,,,} can be extended to a convergent sequence {z; € 
A;}, and so a € A. Thus we have shown A is closed. 

Proof of (c): Let € > 0. There exists an N such that form,n > N,h(A,,, A,) < €. 
Now let n > N. Then for m > n, A, C A, + €. We need to show that A C A, + €. 
To do this, let a € A. There is a sequence {a; € A;} that converges to a. We may 
assume WN is also large enough so that for m > N, d(am, a) < €. Then a», € A, + € 
since A, C A, + €. Since A, is compact, it can be shown that A, + € is closed. Then 
SINCE Am € A, + € for all m > N,a must also be in A, + €. This completes the proof 
that A C A, + € for n large enough. 

Proof of (d): Suppose A were not totally bounded. Then for some € > 0 there 
would not exist a finite €-net. We could then find a sequence {x;}?°, in A such that 
d(x;,x;) => € for i # j. We shall show that this gives a contradiction. By (c) there 
exists an n large enough so that A C A, + §. For each x;, there is a corresponding 
y; € A, for which d(x;, y;) < $. Since A, is compact, some subsequence {y,,} of 
{y;} converges. Then we can find points in the sequence {y,,} as close together as we 
wish. In particular we can find two points y,, and y,, such that d(yn,, Yn,) < 5. But 
then 


€ € € 
A(Xn;s Xn,;) = A(Yn;s Vn) + d(yn;s Yn;) = A(Yn;1 Xn;) « 4 + e + _ 


and we have a contradiction to the way {x,,} was chosen. Thus A is totally bounded 
and by part (b) compact. 

Proof of (e): From part (d), A € H(X). Hence by part (c) and Lemma 7.1 the 
proof that lim A; = A will be complete if we show that for e€ > 0, there exists an 
N such that for n > N, A, C A +e. To show this let « > 0 and find N such that 
for m,n > N,h(Am, An) < = Then for m,n >N,AmCA,+ = Let n > N. We 
will show that A, C A +e. Let y € A,. There exists an increasing sequence {N;} 
of integers such that n < N; < No < N3<---<N,<-:--andform,n>WN;,AnC 
A, + sn- Note that A, C Ay, + 5. Since y € Ay, there is an xy, € Ay, such that 
d(y, Xy,) < 5. Since xy, € Ay,, there is a point xy, € Ay, such that d(xy,, xy.) < 5. 
In a similar manner we can use induction to find a sequence xy,, Xn, XN;;---, Such 
that xy, € Ay, and d(xy,, xn,,,) < 377- Using the triangle inequality a number of 
times we can show that 


d(y,xn,) <5 forall j 


and also that {xy,} is a Cauchy sequence. From the way n was chosen, each Ay, C 
A, + §. {xy,} converges to a point x and since A, + 5 is closed, x € A, + § also. 
Moreover, d(y, xy,) < € implies that d(y, x) <€. We have thus shown that A, C 
A +e for n> WN. This completes the proof that lim A, = A and consequently that 
(H(X), h) is a complete metric space. 
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Examples & Exercises 


7.3. A tree waves in the wind. A special camera photographs the tree at times ¢, = 
(i — +) sec, n = 1,2,3,.... Show that with reasonable assumptions the sequence 
of pictures thus obtained form a Cauchy sequence {A,}°, in A(R’). What does 
A = limy-co An look like? 


7.4. The Sierpinski triangle A is a compact subset of (IR*, Euclidean). Hence 
(A, Euclidean) is a compact metric space. Given an example of an infinite set in 
(H(A, h), demonstrate a Cauchy sequence {A, € 7((A)} that is contained in your 
set and describe its limit. 


7.5. Figure II.27 shows a convergent sequence of sets in 7{(@) converging to a fern. 
Pick a point in A. Find a Cauchy sequence {x, € A,} that converges to it. 


* 7.6. Let (X,d) be a compact metric space. Show that (H(X), 4) is a compact 
metric space, where h is the Hausdorff metric on the space 7{(X). 


It’s a good idea to get familiar with (X), by trying to see what properties of the 
space X are also true of 7{(X). We have seen that completeness is one of these, and 
exercise 7.6 shows that compactness is as well. So are some forms of connectedness, 
and by way of example we prove this for path-connectedness of 71(R) as follows: 


Theorem 7.2 The function f :R — H given by f (x) = {x} is continuous. 


Proof Let {x,,} be a sequence in R that converges to a point x. Then given € > 
O there is an N such that for n > N, d(x,, x) < €. By definition, h({x,}, {x}) = 
d(Xn,X), Since there is only one element in each set. Hence {{x,}} converges to {x} 
in H(R). So the function f(x) is continuous. Thus the image of R in 7{(R) is path- 
connected. Notice that a function like this guarantees that there will be a copy of any 
space in its associated space of non-empty compact sets. 


Theorem 7.3 The functions f,: [0,1] > H given by f,(a) =[x,x +a],0< 
a <1 are continuous, thereby showing that there is a path in ‘H from an interval to 
one of its endpoints. 

Proof As before, let {a,,} be a convergent sequence in [0, 1] with limit a. Suppose 
d(a,,a) < €. Then the distance h([x, x + ay], [x, x + a]) is given by 
h([x, Xx = = anl, [xx +a}) = d([x, x + An], LX Xx + a}) V d([x, x ~ a}, [x, x + Gn]) 

=d(a,a,) <€. 
Hence each of the functions f, is continuous. 

Theorem 7.4 If A is a compact subset of R then the function f,:(0,b] > H 
given by fa(a) = Ux, x +a] such that x € A is continuous. 

Proof By part (b), the function f,4 is continuous at each point in A, when b = 1. 
Hence the statement is true for b = 1. To get the whole statement, we note that if 
g:[0,b] — [0, 1] is given by g(x) = (1/b)x, g is continuous. Hence f, can be 
written as the composition of two continuous functions and is continuous. 
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Figure 11.27. A Cauchy 
sequence of sets {A,} in 
the space 71(R7) converg- 
ing to a fern-like set. 
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Theorem 7.5 /f A is a compact subset of R then the set \[x, x + b] such that 
x € A is an interval for b large enough. 


Proof A is compact, hence it is bounded. If we take b to be the length of any 
interval [c, d] such that A C [c, d], then the set [x, x + b] with x the least element 
of A will overlap the set [y, y + b] where y is the greatest element of A. Thus all 
the sets of this form will overlap, and their union forms one interval, which is path- 
connected. 


Theorem 7.6 /f A and B are compact subsets of R then there is a path in H 
connecting them. 


Proof For any space X and three points a, b, c € X, if there is a path from a to b 
and one from b to c then there is one from a to c. We construct a path between A and 
B in H(R) by taking first a path from A to the interval formed in part (d), namely 
choose b as in (d) and construct the path 


f :[0, 1] > [0, b] > (fa): 05x <5}, 


which is continuous by part (c). Thus there is a path between every point in 7((R) 
and a point that is an interval. Similarly, there is a path from some interval to every 
point in H(R). By part (b) there is a path from every interval to one of its endpoints. 
By part (a) there is a path (the image of R) between any two of these endpoints. 
Therefore a path from A to B may be constructed by taking A to an interval, taking 
the interval to an endpoint, moving to the a new point of R, making a path to 
an interval, and taking the path from the interval to B. Hence H(R) is pathwise- 
connected. It is true, though somewhat more complicated, that if X is connected then 
so is 71(X). 


8 Additional Theorems about Metric Spaces 


We state here a number of theorems that we shall use later on. Full proofs are not 
provided. They can be found in most introductory topology texts. We particularly 
recommend [Kasriel 1971] and [Mendelson 1963]. These theorems may be treated 
as exercises in metric space theory. 


Theorem 8.1 Let (X,d) be a metric space. Let {x,} be a Cauchy sequence 
convergent to x € X (or equivalently let {xn} be a sequence and x be a point, such 
that limy—+oo A(X, X,) = 0). Let f : X — X be continuous. Then 


Lim f Gn) = f(@). 


Proof See your first calculus book. 


Theorem 8.2 Let (X;, d,) and (X2, do) be metric spaces. Let f : X, — Xz be 
continuous. Let E C X, be compact. Then f : E — Xz is uniformly continuous: that 
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is, given € > O there is a number 5 > 0 so that 


d2(f (x), f(y)) < € whenever d)(x, y) <6 forall x, ye E. 


Proof Use the fact that any open cover of E contains a finite subcover. 


Theorem 8.3 Let (X;,d;) be metric spaces for i = 1,2,3. Let f :X, x X.> 
X3 have the following property. For each € > O there exists 5 > 0 such that 
(1) di(x1, v1) < 8 => 43(f (x1, x2), f (M1, y2)) <€, 
Wx1, y1 © X 
Vx € X> 
and 
(ii) da(x2, 2) < 6 > a3(f (M1, x2), fO1, 2) <, 
Vy od Xj 
Vx2, y2 © X2 
Then f is continuous on the metric space (X = X; x X2, d), where d((x), x2), (y1, 
y2)) = max{d; (x1, y1), d2(x2, y2)}. 


Proof Use 
d(f (x1, X2), FO, y2)) < d(f x1, x2), f 1, x2)) + d(f (1, x2), fF 1, y2)), 
but check first that d is a metric. 


Theorem 8.4 Let (X;, d;) be metric spaces for i = 1,2 and let the metric space 
(X, d) be defined as in Theorem 8.3. If K, C X, and Kz C X» are compact, then 
K, x Kz C X is compact. 


Proof Deal with the component in K;, first. 


Theorem 8.5 Let (X;, d;) be compact metric spaces for i = 1,2. Let f :X; —> 
X2 be continuous, one-to-one, and onto. Then f is a homeomorphism. 
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Transformations on the Real Line 


Fractal geometry studies “complicated” subsets of geometrically “simple” spaces 
such as R2, C, R, and C. In deterministic fractal geometry the focus is on those 
subsets of a space that are generated by, or possess invariance properties under, 
simple geometrical transformations of the space into itself. A simple geometrical 
transformation is one that is easily conveyed or explained to someone else. Usually 
it can be completely specified by a small set of parameters. Examples include affine 
transformations in R*, which are expressed using 2 x 2 matrices and 2-vectors, and 
rational transformations on the Riemann Sphere, which require the specification of 
the coefficients in a pair of polynomials. 


Definition 1.1 Let (X, d) be a metric space. A transformation on X is a function 
f :X — X, which assigns exactly one point f (x) € X to each point x € X. IfS CX 
then f(S) ={f(x):x € S}. f is one-to-one if x, y € X with f(x) = f(y) implies 
x=y. f is onto if f(X) =X. f is called invertible if it is one-to-one and onto: in 


1 Transformations on the Real Line 


this case it is possible to define a transformation f~'! :X — X, called the inverse of 
f, by f~'(y) =x, where x € X is the unique point such that y = f (x). 


Definition 1.2 Let f: X — X be a transformation ona metric space. The forward 
iterates of f are transformations f°" :X — X defined by f(x) =x, f(x) = 
f(x), feet (x) = fo f(x) =f f™)) forn=0,1,2,.... If f is invertible 
then the backward iterates of f are transformations f°~™ (x) : X — X defined by 
PVs fF = eS... 


In order to work in fractal geometry one needs to be familiar with the basic fam- 
ilies of transformations in R, R*, C, and C. One needs to know well the relation- 
ship between “formulas” for transformations and the geometric changes, stretchings, 
twistings, foldings, and skewings of the underlying fabric, the metric space upon 
which they act. It is more important to understand what the transformations do to 
sets than how they act on individual points. So, for example, it is more useful to 
know how an affine transformation in R? acts on a straight line, a circle, or a trian- 
gle, than to know to where it takes the origin. 


Examples & Exercises 
1.1. Let f: X — X be an invertible transformation. Show that 


om 5 fon — folm-+tn) for all integers m and n. 
g 


1.2. A transformation f : R — R is defined by f(x) = 2x for all x € R. Is f in- 
vertible? Find a formula for f°” (x) that applies for all integers n. 


1.3. A transformation f : [0, 1] — [0, 1] is defined by f(x) = 5x. Is this transfor- 
mation one-to-one? Onto? Invertible? 


1.4. The mapping f : [0, 1] — [0, 1] is defined by f(x) = 4x - (1 — x). Is this trans- 
formation one-to-one? Onto? Is it invertible? 


1.5. Let C denote the Classical Cantor Set. This subset of the metric space [0, 1] 
is obtained by successive deletion of middle-third open subintervals as follows. We 
construct a nested sequence of closed intervals 


le 01 iD Oa I i i ca Os 


where 
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Figure 111.28. Con- 
struction of the Classical 
Cantor Set C. 
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I, = I; take away the middle open third of each interval in J;, 


I, = Iy_, take away the middle open third of each interval in Iy_,. 


This construction is illustrated in Figure III.28. We define 
Cael Via ly: 


C contains the point x = 0, so it is nonempty. In fact C is a perfect set that contains 
uncountably many points, as discussed in Chapter IV. C is an official fractal and we 
will often refer to it. 

We are now able to work in the metric space (C, Euclidean). A transformation 
f :C — Cis defined by f(x) = tx. Show that this transformation is one-to-one but 
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Subsets of | are transported Figure 111.29. The ac- 
a real interval tion of the affine transfor- 
_ length L f : length [aj L mation f : R — R defined 


by f(x) =ax +b. 


! (1 
rotate 180° about Rai, " 
if a is less than zero. 


0 ig SOS pre 1 Figure l11.30. This fig- 


ure suggests a sequence of 
intervals {J,,}°°). Find an 
affine transformation f : 
R — R so that fC) = 
1, t0r 07 T, 23, 
Use a straight-edge and 
dividers to help you. 


, ly Iz |g 


not onto. Also, find another affine transformation (see example 1.7, which maps C 
one-to-one into C). 


1.6. f : R* — R? is defined by f(x), x2) = (2x1, x3 + x1) for all (x1, x2) € R2. 
Show that f is not invertible. Give a formula for f°?(x). 


1.7. Affine transformations in R! are transformations of the form f(x) =a-x +b, 
where a and D are real constants. Given the interval J = [0, 1], f (7) is a new interval 
of length |a|, and f rescales by a. The left endpoint 0 of the interval is moved to b, 
and f (J) lies to the left or right of b according to whether a is positive or negative, 
respectively (see Figure IJI.29). 

We think of the action of an affine transformation on all of R as follows: the whole 
line is stretched away from the origin if |a| > 1, or contracted toward it if |a| < 1: 
flipped through 180° about O if a < 0; and then translated (shifted as a whole) by 
an amount b (shift to the left if b < 0, and to the right if b > 0). 


1.8. Describe the set of affine transformations that takes the real interval X = [1, 2] 
into itself. Show that if f and g are two such transformations then f o g and go f 
are also affine transformations on [1, 2]. Under what conditions does f o g(X) Ugo 
f(%) =X? 


1.9. A sequence of intervals {J,}°°, is indicated in Figure III.30. Find an affine 
transformation f : R > R so that f°’(o) = J, forn = 0, 1,2, 3,.... Use a straight- 
edge and dividers to help you. Also show that {/,}°2, is a Cauchy sequence in 


(7{(R), h), where h is the Hausdorff distance on H(R) induced by the Euclidean 
metric on R. Evaluate J = lim,-,. /,. 
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: es eee 
Figure Ill.31. Picture 


of a convergent geometric lo a ly | 3 Ig |e Ig 
series in R! (see exer- 
cise 1.10). 


1.10. Consider the geometric series °° )b-a"=b+a-b+a’b+a°b+a*b+ 
..»>0,0 <b <1. This is associated with a sequence of intervals Io = [0, 5], J, = 
f"(Io), where f(x) =ax + b,n=1, 2, 3,..., as illustrated in Figure III.31. 

Let J = U%o/, and let / denote the total length of J. Show that f(/) = / \ Jo, and 
hence deduce that al =/ — b so that / = b/(1 — a). Deduce at once that 


(o,@) 

)\b-a" =b/(1 — a). 

n=0 
Thus we see from a geometrical point of view a well-known result about geometric 
series. Make a similar geometrical argument to cover the case —1 <a < 0. 


Definition 1.3 A transformation f :R — R of the form 
f() =a Faux + anx” + a3x? +--+ a,x", 


where the coefficients a; (i =0, 1,2,..., N) are real numbers, a, #0, and N isa 
nonnegative integer, is called a polynomial transformation. N is called the degree of 
the transformation. 


Examples & Exercises 


1.11. Show that if f : R > R and g: R > R are polynomial transformations, then 
sois f og. If f is of degree N, calculate the degee of f°"(x) form = 1, 2, 350° 


1.12. Show that for n > 1 a polynomial transformation f: R — R of degree n is not 
generally invertible. 


1.13. Show that far enough out (i.e., for large enough |x|), a polynomial transfor- 
mation f : R — R always stretches intervals. That is, view f as a transformation 
from (R, Euclidean) into itself. Show that if J is an interval of the form / = {x: 
|x — a| < b} for fixed a, b € R, then for any number M > 0 there is a number f > 0 
such that if b > B, then the ratio (length of f (/))/(ength of J) is larger than M. This 
idea is illustrated in Figure III.32. 


1.14. A polynomial transformation f : R > R of degree n can produce at most 
(n — 1) folds. For example f(x) = x? — 3x + 1 behaves as shown in Figure III.33. 


1.15. Find a family of polynomial transformations of degree 2 which map the inter- 
val [0, 2] into itself, such that, with one exception, if y € f([0, 2]) then there exist 
two distinct points x; and x2 in [0, 2] with f(x1) = f(x2) = y. 


1.16. Show that the one-parameter family of polynomial transformations f, : 


1 Transformations on the Real Line 


Figure 11.32. A poly- 
R o nomial transformation 
See ee eee ee nee ee ees ee ee meee er f :R— R of degree > 1 


stretches R more and more 
the farther out one goes. 
R 


X-axis at large positive x 


x-axis at large positive x 


[O, 2] — [0, 2], where 
fix) =A-x-(2—x), 
and the parameter 4 belongs to [0, 1], indeed takes the interval [0, 2] into itself. 


Locate the value of x at which the fold occurs. Sketch the behavior of the family, 
in the spirit of Figure III.33. 3 


1.17. Let f : R > R be a polynomial transformation of degree n. Show that values 
of x that are transformed into fold points are solutions of 


ie =0,xeER. 

dx 

Solutions of this equation are called (real) critical points of the function f. If c isa 
critical point then f(c) is a critical value. Show that a critical value need not be a 
fold point. 

1.18. Find a polynomial transformation such that Figure III.34 is true. 

1.19. Recall that a polynomial transformation of an interval f : 7 C R —> J is nor- 
mally represented as in Figure III.35. This will be useful when we study iterates 
f°" (x)p2,. However, the folding point of view helps us to understand the idea of 
the deformation of space. 


1.20. Polynomial transformations can be lifted to act on subsets of R* in a simple 


Figure II1.33. The 
polynomial transformation 
f(x) =x? — 3x +1. 


triply folded region 
fold point fold point 


4] 
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polynomial transformation 
f :R—R, so that this 
figure correctly represents 
the way it folds on the real 
line. 


Figure IlI.34. Find a | eas Sea Se ES Se 
| 


fold point fold point fold point 
R 2 ; » | . : 
2 ae ee ee 


Figure I11.35. The 
usual way of picturing a edtical 
polynomial transforma- value 
tion. 


way: we can define, for example, F(x) = (fi(1), f2(%2)), where f; and f2 are 
polynomial transformations in R, so that F: R? — R?. Desired foldings in two 
orthogonal directions can be produced; or shrinking in one direction and folding in 
another. Show that the transformation F(x, x2) = (3x? - Oy? + 5,, x) acts on 


5 5 
the triangular set S in Figure III.36 as shown. 


The real line can be extended to a space which is topologically a circle by includ- 
ing the point at infinity. One way to do this is to think of R as a subset of C, and 


Figure 11.36. A poly- 
nomial transformation 
acting on a set S in the 
plane. 


fold lines 
double cover 
triple cover 


2 Affine Transformations in the Euclidean Plane 


N Figure 111.37. RU {00} 
becomes a circle on a 
sphere. 

0 


then include the North Pole on C. We define this space to be R=RU {oo} and will 
usually give it the spherical metric. 
Definition 1.4 A transformation f : RoR defined in the form 
ax +b 
TO os +d’ 
is called a linear fractional transformation or a Mobius transformation. Jf c 4 0, then 
f (—d/c) = 00, and f (00) =a/c. Ifc =0, then f (0a) = on. 


a,b,c,d€R,ad £ be, 


Examples & Exercises 

1.21. Show that a Mobius transformation is invertible. 

1.22. Show that if f; and f) are both Mobius transformations then so is f; 0 fo. 
1.23. What does f(x) = 1/x do to R on the sphere? 


1.24. Show that the set of MObius transformations f such that f (00) = 00 is the 
set of affine transformations. 


1.25. Find a Mobius transformation f : R — R so that f(Q)) =2"F 2) = 
f (0) = oo. Evaluate f (0c). 


1.26. Figure III.38 shows a Sierpinski triangle before and after the polynomial 
transformation x +> ax(x — b) has been applied to the x-axis. Evaluate the real con- 
stants a and b. Notice how well fractals can be used to illustrate how a transforma- 
tion acts. 
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Definition 2.1 A transformation w : R* > R? of the form 
w (x1, X2) = (ax; + bx2 + e, cx; + dx2 + f), 


where a,b,c,d,e, and f are real numbers, is called a (two-dimensional) affine 
transformation. 
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We will often use the following equivalent notations 
oes xiy {a 2B x] oS 
win-o(2)=(2 3) () (Ga 
Here A = e 4 is a two-dimensional, 2 x 2 real matrix and ¢ is the column 


vector S which we do not distinguish from the coordinate pair (e, f) € R’. 


Such transformations have important geometrical and algebraic properties. From this 
point on, we shall assume that the reader is familiar with matrix multiplication. 


The matrix A can always be written in the form 


a b\ _ /(r,cos@ —r7 sin 0, 
ee = r;, sin 6; rz COS A> 


BEFORE 
TRANSFORMATION 


AFTER (X,Y)—(aX:(X-b),Y) 
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Figure 111.38. A Sierpinski triangle before and after the polynomial transformation 
x +> ax(x — b) is applied to the x-axis. Evaluate the real constants a and b. 
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Figure 111.39. An affine 
transformation takes par- 
allelograms into parallelo- 
grams. 


length r, 


- i 
rs psecawes Ser Sa | 


i 


Figure 11.40. A linear 
transformation can turn 
pictures over. 


A( Xi - . COS 6; -F2 sin oA Xi 
x; r sin eo: fr. COS e /\ xX 


O 


where (7;, 6,) are the polar coordinates of the point (a, c) and (72, (Q2 + 2/2)) are 
the polar coordinates of the point (b, d). The linear transformation 


deals) 

X2 x2 

in R* maps any parallelogram with a vertex at the origin to another parallelogram 

with a vertex at the origin, as illustrated in Figure III.39. Notice that the parallelo- 

gram may be “turned over” by the transformation, as illustrated in Figure III.40. 
The general affine transformation w(x) = Ax +t in R? consists of a linear trans- 


formation, A which deforms space relative to the origin, as described above, fol- 
lowed by a translation or shift specified by the vector t (see Figure III.41). 
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Figure Ill.41. An affine 
transformation consists 

of a linear transformation 
followed by a translation. 


FIRST MAKE A LINEAR 
TRANSFORMATION 


(e,f) 


THEN 


TRANSLATE by t 


Figure 111.42. Two 

ivy leaves lying on the 
Euclidean Plane determine 
an affine transformation. 


How can one find an affine transformation that approximately transforms one 
given set into another given set in R?? Let’s show how to find the affine transfor- 
mation that almost takes the big leaf to the little leaf in Figure III.42. This figure 
actually shows a photocopy of two real ivy leaves. We wish to find the numbers 
a,b,c,d,e, and f defined above, so that 


w(BIG LEAF) approximately equals LITTLE LEAF. 


2 Affine Transformations in the Euclidean Plane 53 


Begin by introducing x and y coordinate axes, as already shown in Figure III.42. 
Mark three points on the big leaf (we’ve chosen the leaf tip, a side spike, and the 
point where the stem joins the leaf) and determine their coordinates (x,, x2), (y1, y2), 
and (z;, Z2). Mark the corresponding points on the little leaf, assuming that a cater- 
pillar hasn’t eaten them, and determine their coordinates; say (X, X2), (V1, ¥2), and 
(Z1, Z2), respectively. 

Then a, b, and e are obtained by solving the three linear equations 


xja+xb+e=X, 

yat+ yb+e=y, 

zatzwb+e=2; 
while c, d, and f satisfy 

xyC + xd + f =X, 

yic + yod + f = ya, 

zic + Zod + f = Zp. 


Examples & Exercises 


2.1. Find an affine transformation in R? that takes the triangle with vertices at 
(0, 0), (0, 1), (1, 0) to the triangle with vertices at (4,5), (—1, 2), and (3, 0). Show 
what this transformation does to a circle inscribed in the first triangle. 


2.2. Show that a necessary and sufficient condition for the affine transformation 


i a ()+(4)aare: 


to be invertible is det A + 0, where det A = (ad — bc) is the determinant of the 2 x 2 
matrix A. 


2.3. Show that if f; : R? > R* and f): R* > R? are both affine transformations, 
then so is 

fa= fico fa. 
If fi(x) = Ajx + ¢;,i = 1, 2,3, where A; is a2 x 2 real matrix, express A3 in terms 


of Aj and A2. 


2.4. Let A and B be 2 x 2 matrices, with determinants det A and det B, respec- 
tively. Show that the determinant of the product is the product of the determinants, 
1.¢., 


det(A B) = det A - det B. 
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Definition 2.2 A transformation w : R? > R? is called a similitude if it is an 
affine transformation having one of the special forms 


- a \ fF S080. —7 sin 6 x] 4 e 
x.) \rsin@d rcoséd } \x yi 
o 2 rcos@- rsin@ x] 4 e 
x.) \rsin@d —rcosé) \ x2 


for some translation (e, f) € R?, some real number r #0, and some angle 0,0 < 
@ <2n. @ is called the rotation angle while r is called the scale factor or scaling. 
The linear transformation 


R x1\__ {rcosé —rsin@ x] 
°\ x) \rsin@ rcosé x9 


is a rotation. The linear transformation 
xX} 1 0 x} 
R a 
oa € af a 


Figure [11.43 shows some of the things a similitude can do. Notice that a simili- 
tude preserves angles. 


Examples & Exercises 


2.5. Find the scaling ratios r;, r2 and the rotation angles 6), 62 for the affine trans- 
formation that takes the triangle (0, 0), (0, 1), (1, 0) onto the straight-line segment 
from (1, 1) to (2, 2) in R2 in such a way that both (0, 1) and (1, 0) go to (1, 1). 


is a reflection. 


2.6. Let S be a region in R? bounded by a polygon or other “nice” boundary. Let 
w: R2 —> R?2 be an affine transformation, w(x) = Ax + t. Show that 


(area of w(S)) = | det A| - (area of S); 


see Figure III.44. Show that det A < 0 has the interpretation that S is “flipped over’ 
by the transformation. (Hint: suppose first that S is a triangle.) 


2.7. Show that if w : R? > R? is a similitude, w(x) = Ax + t, where t¢ is the trans- 
lation and A is a 2 x 2 matrix, then A can always be written either A =rRg or 
A= rRRo. 


2.8. View the railway tracks image in Figure III.45 as a subset S of R?. Find a 
similitude w : R? — R? such that w(S) c S, w(S) # S. 


2.9. We use the notation introduced in Definition 2.2. Find a nonzero real number 
r, an angle 6, and a translation vector t such that the similitude wx =r Rox + ft on 
R? obeys 

w(A)C A, with w(A) £ A, 


where A denotes a Sierpinski triangle with vertices at (0, 0), (1, 9), and G, 1}. 
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Figure 111.43. Some of 
the things that a similitude 
can do. 


‘ A Sy 
: SY, ke & 


<n ~s - a in nec ee 


2.10. Show that if w : R? > R? is affine, w(x) = Ax +1, then it can be reexpressed 


oe r\ 0 r3 0 Xi 
we = (4 ») (6 md An) tf 


where r; € R and 0 < 6 < 27. We call a transformation of the form 
o(2)=(8 2)(2) 
X2 0 ro X2 


2.11. Let S denote the two-dimensional orchard subset of R* shown in Fig- 
ure III.46. Find two fundamentally different affine transformations that map S into 
S but not onto S. Define the transformations by specifying how they act on three 
points. 


2.12. Show that if A is a2 x 2 matrix such that det A 4 0, with 


a b 
aol 2): 


then the inverse of A, denoted A~!, is given by 


d —b 
, 2s 1 = PG bl my: 
det A oe’ a aT 


a coordinate rescaling. 
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Figure |11.44. The scal- 


ing factor by which an FON 

affine transformation MEE a a 
changes area is deter- eo a AFFINE =~ 
mined by the determinant Lo SRP EO TRANSFORMATION 
of its linear part. ES W(x)=Axet 


Figure 111.45. Railway 
to infinity. Can you find an 
affine transformation that 
nearly maps the track ties 
into themselves? 


2 Affine Transformations in the Euclidean Plane 57 


ae oe 

‘ ry NOP la? Woy oe se 

12 Son Vey dy H aba Fy “f°. 
ay 


¥ ' 
rar SUEY, 


4 vay 
rene x 
a ae 
a as > NY 
oe el 


Figure II1.46. Orchard subset of R?. Can you find some interesting affine transformations 
that map this set into itself? 


2.13. The trace of a matrix A is the sum of the elements along the diagonal, that is 


trA= > aii. 


Let A be a2 x 2 matrix, and let B be another 2 x 2 matrix such that det B 4 0. Show 
that 


tr(BAB"')=trA 
and 


det(BAB~') = det A. 


2.14. Let w(x) = Ax denote a linear transformation in the metric space (R?, D) 
where 
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a=(¢ 2) 


Define the norm of a point x € R? to be |x| = D(x, O), where O denotes the origin. 
Define the norm of the linear transformation A by 


|A| = max | :xeE Rx 40| 
|x| 
when this maximum exists. Show that | A| is defined when D is the Euclidean metric 
and when it is the Manhattan metric. Find an expression for |A| in terms of a, b, c, 
and d in each case. Make a geometrical interpretation of |A|. Show that when |A| 


exists we have 


|Ax| < JA] - |x| for all x € R?. 


3 Md6dbius Transformations on the Riemann Sphere 


Definition 3.1 A transformation f : C — C defined by 
(az +b) 
(cz +d)’ 


where a,b,c, and d € C, ad — bc £0, is called a Mobius transformation on C. If 
c #0 then f(—d/c) = ~, and f (co) =a/c. Ifc =0, then f (oo) = oo. 


f@= 


As shown by the following exercises and examples, one can think of a Mobius 
transformation as follows. Map the whole plane C, together with the point at infin- 
ity, onto the sphere C, as described in Chapter II. A sequence of operations is then 
applied to the sphere. Each operation is elementary and has the property that it takes 
circles to circles. The possible operations are rotation about an axis, rescaling (uni- 
formly expand or contract the sphere), and translation (the whole sphere is picked 
up and moved to a new place on the plane, without rotation). Finally, the sphere is 
mapped back onto the plane in the usual way. Since the mappings back and forth 
from the plane to the sphere take straight lines and circles in the plane to circles on 
the sphere, we see that a Mobius transformation transforms the set of straight lines 
and circles in the plane onto itself. We also see that a Mobius transformation is in- 
vertible. It is wonderful how the quite complicated geometry of M6bius transforma- 
tions is handled by straightforward complex algebra, where we simply manipulate 
expressions of the form (az + b)/(cz +d). 


Examples & Exercises 


3.1. Show that the most general Mobius transformation, which maps oo to 00, is of 
the form f(z) =az+b,a,b¢€C,a #0, and that this is a similitude. Show that any 


3 Mobius Transformations on the Riemann Sphere 


sets in the plane are 
mapped onto the sphere 


Figure 111.47. A Mébius transformation acting on England to produce a new country. 


two-dimensional similitude that does not involve a reflection can be written in this 
form. That is, disregarding changes in notation, 
f(z) = f (x1 + ix) = (4, + ian) (x1 + 1x2) + (b1 + ib) 
= re’? (x; + ix.) + (b; + ib), G = /-1) 


_ {rcosé —r sin@ x1 ue b, 

~ \rsind rcosé x2 by } 
Find r and @ in terms of a; and a2. Show that the transformation can be achieved as 
illustrated in Figure III.48. 


3.2. Show that the Mobius transformation f(z) = 1/z corresponds to first mapping 
the plane to the sphere in such a way that the unit circle {z € C: |z| = 1} goes to 
the equator, followed by an inversion of the sphere (turn it upside down by rotating 


the sphere may be moved to 
a new location on the plane, 
rescaled, rotated, and inverted. 
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Figure 111.48. The x N 
mechanism of the simil- \ 
itude f(z) =re’z +b in 
terms of the sphere. 


(1)map him onto 
the sphere 


(2) spin the globe 
on its NS axis 
hrough angle e 


iecies the NS 
vector constant 
pick up and move 
the sphere, putting 
S down on b. 


3) rescale the globe 
N “until it is r times 


JS 
Ss ae 
/ 
(5) map him ae . 
the plane ~ - 


e) 
® 


about an axis through +1 and —1 on the equator), and finally mapping back to the 
plane. 


3.3. Show that any Mobius transformation that is not a similitude may be written 


pcan for some e, f,g €C, f £0. 
ie 


3.4. Sketch what happens to the picture in Figure III.49 under the Mobius transfor- 
mation f(z) = +. 


3.5. What happens to Figure III.49 under the Mobius transformation f(z) = 1 +iz? 


4 Analytic Transformations 


Figure IIl.49. Up the 
Garden Path. What does 
the Mobius transformation 
z+» 1+/iz do to this 
picture? 


3.6. Show algebraically that a MObius transformation f : C > Cis always invert- 
ible. 


3.7. Show that if f; and f> are MGbius transformations, then fj o f2 is a Mobius 
transformation. 


3.8. Find a Mobius transformation that takes the real line to the unit circle centered 
at the origin. 


3.9. Evaluate f°"(z) if. f(z) = 1/1 +z), n € {—2, —1,0, 1, 2, 3,...}. 


3.10. Interpret the Mébius transformation f(z) =i + 1/(z — i) in terms of opera- 
tions on the sphere. 
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4 Analytic Transformations 


In this section we continue the discussion of transformations on the metric spaces 
(C, Euclidean) and (C, Spherical). We introduce a generalization of the Mobius 
transformations, called analytic transformations. We concentrate on the behavior 
of quadratic transformations. It is recommended that during a first reading or first 
course the reader obtains a good mental picture of how the quadratic transformation 
acts on the sphere. The reader may then want to study this section more closely after 
reading about Julia sets in Chapter VII. 
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The similitude f : C > C defined by the formula f(z) = 3z +1 is an example 
of an analytic transformation. It maps circles to circles magnified by a factor of 
three. A disk with center at zp is taken to a disk with center at f(z) = 3z) + 1. The 
tranformation is continuous, and it maps open sets to open sets. Nowhere does it 
“fold back along the dotted line.” 

The similitude f : C + C defined by f(z) = 3+ 3i)z + (i — 2i) is similarly de- 
scribed. The circles and disks are now rotated by 45° in addition to being magnified 
and translated. 

Loosely a transformation on C is analytic if it is continuous and it locally “be- 
haves like” a similitude. If you take a very small region indeed (How small? Small 
enough! There is a smallness such that what is about to be said is true!) and you 
watch what the transformation does to that tiny region, you will typically find that 
it is magnified or shrunk, rotated, and translated, in almost exactly the same manner 
that some similitude would do the job. The similitude will always be of the special 
type discussed in exercise 3.1 above. 

We make this description more precise. Let us decide to look at what our transfor- 
mation does in the vicinity of a point Zo € C. Assume that Zo is not a critical point, 
defined below. Let T denote a tiny region, a disk for example, which contains the 
point zo. Let f (7) be its image under the transformation. Then one can rescale T by 
a factor that makes it roughly the size of the unit square, and one can rescale f(T) 
by the same factor. The assertion of the previous paragraph is that the action of the 
transformation, viewed as taking 7, rescaled, onto f (7), rescaled, can be described 
more accurately by a similitude. If you like, one could consider a picture P drawn 
in J and examine the transformed image f(P): if P and f(P) are rescaled by the 
same factor so that P is the size of the unit square, then {(P) looks more and more 
like a similitude applied to P. This description becomes more and more precise the 
tinier the region under discussion. 

Consider the quadratic transformation f : C — C defined by 


F(Z) =22 = (xy + ing)? = (x? — x2) + xp roi = fxr, x2) + fori, x2)i, 


where fi (x1, x2) = (x? — x?) is called the real part of f(z), and fo(x1, x2) = 2x)x2 
is called the imaginary part of f. Pictures of what this transformation does to some 
Sierpinski triangles in C are illustrated in Figure III.50. 

Two features are to be noticed. (I) Provided that we stay away from the origin, 
the transformation behaves locally like a similitude: for points z close to zp , f(z) is 
approximated by the similitude 


w(z)=az+b — where a = 2z and b = —z2. 


This fact shows up in Figure III.50: upon close examination (we suggest the use of a 
magnifying glass here) of the transformed Sierpinski triangles, one sees that they are 
built up out of small triangles whose shapes are only slightly different from that of 
their preimages. The only place where this is not true is at the forward image of the 
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Figure 111.50. Quadrat- 
Sierpinski triangle before ic transformations are 
————— described by showing how 
they act on a Sierpinski 
triangle. Use a magnifying 
glass to check that the 
transformations behave 
locally like similitudes. 


After z — 7 


origin, which is a critical point. (II) The transformation maps the space twice around 
the origin. 
One can track analytically what happens to the point 


z=Rcost+iksint, 


where R > 0. As the time parameter t goes from zero to 277, z moves anticlockwise 
once around the circle of radius R. The transformed point f(z) is given by 


Tue R* cos 2t +i R? sin 2t. 


As the time parameter t goes from 0 to 27, f(z) moves twice around the circle of 
radius R?. 

On the Riemann sphere the transformation z +> z” can be described as follows. 
Let us say that the Equator corresponds to the circle of unit radius in the plane, that 
the South Pole corresponds to the Origin, and that the North Pole corresponds to the 
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Point at Infinity. Then the transformation leaves both Poles fixed. The Line of Longi- 
tude L connecting the Poles, which corresponds to the positive real axis, is mapped 
into itself, and the Equator is mapped into itself. Here is what we must picture. First, 
points that lie above the Equator are moved closer to the North Pole; points that lie 
below the Equator are moved closer to the South Pole; and the Equator is not shifted. 
Second, the skin of the sphere is cut along the Line of Longitude L. One side of the 
cut is held fixed while the other side is pulled around the sphere (following the ter- 
minator when the Sun is high above the Equator), uniformly stretching the space, 
until the edge of the cut is back over L. The two lips of the the cut are rejoined. 
The sphere has been mapped twice over itself. The Poles are the critical points of the 
transformation; they are the points about which wrapping occurs. This description is 
illustrated in Figure III.51. 

The most general quadratic transformation on the sphere is expressible by a for- 
mula of the form f(z) = Az* + Bz +C, where A, B, and C are complex numbers. 


Figure 111.51. The ac- 
tion of the quadratic trans- 
formation z +> z” in terms 


(1) POINTS ABOVE 
of the sphere. THE EQUATOR MOVE 
CLOSER TO THE 
NORTH POLE: BELOW 


THEY MOVE SOUTH. 


(2)THE SPHERE IS CUT 
ALONG THE LINE OF 
LONGITUDE L. 


(3) ONE EDGE OF THE 
CUT IS PULLED RIGHT 
AROUND THE SPHERE. 
THE SPHERE IS 
COVERED TWICE. 
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One can show there is a change of coordinates, z +> 9(z), where @ is a similitude, 
such that f(z) becomes expressible in the special form f(z) =z?+C for some 
complex number C; see Exercise 5.20 in the following section. Hence the descrip- 
tion of the most general quadratic transformation on the sphere can be made in the 
same terms as above, except that at the end there is a translation by some constant 
amount C. This translation leaves the Point at Infinity fixed. 

The quadratic transformation f(z) = z* maps the punctured plane C onto it- 
self twice. Each point on z € C{0} has two preimages. Hence f : C > C is not 
an invertible transformation. In such situations we can define a set-valued inverse 
function. 


Definition 4.1 Ler f :C > C be an analytic transformation such that f(C) = 
C. Then the set-valued inverse of f is the mapping f—!: H(C) > H(C) defined by 


f(A) ={weC: f(w)eA} forall Ae H(X). 


In Figure 11.52 we illustrate the transformation f—' acting on the Space of Frac- 
tals, in the case of the quadratic transformation f(z) = 2’. 


Figure 111.52. The set 
valued inverse, f'!, of 
the quadratic transforma- 
tion f(z) =z, maps the 
Sierpinski triangle AOB 
into POQ U POQ. More 
generally, f—! maps the 
Space of Fractals into it- 
self. Look carefully at this 
image! Several important 
features of analytic trans- 
formations are illustrated 
here. 
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One can obtain explicit formulas for f —l(z) when f is a quadratic transfor- 
mation. For example for f(z) =z”, f-'(O) = O, f~'(00) = 00, and f~'(z) = 
{w1(z), w2(z)} for z € C \ {0, co}. Here wi (x; + ix2) = a(x), x2) + ib(x1, x2) , and 
w(x}, X2) = —a(X1, X2) — ib(x1, x2), where 


xe txZ+x1 


a(x,,Xx2) = 5 when x» > 0, 
ht +x3 +X 

a(x}, x2) =— > when x2 < 0, 

b(x), x2) _ 


Each of the two functions w)(z) and w2(z) is itself analytic on C \ {0, oo}. 
The following definition formalizes what is meant by an analytic transformation 
on the complex plane. We recommend further reading, for example [Rudin, 1966]. 


Definition 4.2 Let (C, d) denote the complex plane with the Euclidean metric. 
A transformation f : C — C is called analytic if for each zp € C there is a similitude 
of the form 


w(z) =az+b), for some pair of numbers a, b € C, 


such that d(f (z), w(z))/d(z, 2) > O.as z > %. The numbers a and b depend on 2. 
If, corresponding to a certain point z = c, we have a = 0, then c is called a critical 
point of the transformation, and f (c) is called a critical value. 


If the analytic transformation f(z) is a rational transformation, which means that 
it is expressible as a ratio of two polynomials in z, such as 


(i) f (z) = 1+ 2i +272? — 92’, 
1 
iif@=— 


oi l+z+27 

iii) f(z) = ———$; 
Gii) F@ = 
then the numbers a and b in the similitude w(z) in Definition 4.2 are given by the 
formulas 


a= f'(z) and b = f(z) — azo. 


The derivative f’(z) of the rational function f(z) can be calculated by treating z as 
though it were the real variable x and applying the standard differentiation rules of 
calculus. The critical points c € C are the solutions of the equation f’(c) = 0. 

For example, close enough to any point z € C such that f’(zo) #0, the cubic 
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transformation (i) is well described by the similitude 
w(z) = (542 — 2726)z + (1 + 2i — 272 + 1822). 
The finite critical points associated with (i) may be obtained by solving 
54c — 27c* =0 


and are accordingly c= 0+i0 and c = 2 +70. By making the change of coordinates 
z’ = 1/z (see section 5), one can also analyze the behavior near the point at infinity. 
It turns out that c = 00 is always a critical point for a polynomial transformation 
f(z) on C. The space is “wrapped” an integral number of times about the image 
of critical point. For example, the cubic transformation (i) wraps space twice about 
each of the points f(0 +70) = 1+ 2i, and f(2 +10) = 37 + 2i, and it wraps it three 
times about f (co) = oo. 


Examples & Exercises 


4.1. Sketch a globe representing C, including a subset that looks like Africa, and 
show what happens to the subset under the quadratic transformation f(z) = z?. 


4.2. Verify the following explicit formulas for f—'(z), corresponding to f(z) = 
z— 1: f-'(-1) = O; f-'(o0) = 00; and f~'(z) = {wi(z), we(z)} for z € C \ 
{—1, co}, where wi (x; + 1x2) = a(x, x2) + ib(X1, x2), and w2(x1, x2) = —a(x1, x2) 


— ib(x, X2). Here 
(dtm)? +23 +142 


a(x1,X%2) = when x2 > 0, 


z 
Jd+tuyr+ezt+1+x 
a(x}, x2) =— SE Sa ae when x» < 0, 
and 
J (+ x1)? +23 -1-x 
b(x1, x2) = . | ——————_""———__-. 


Z 
Both w,(z) and w2(z) are analytic on C \ {—1}. 


4.3. Locate the critical points and critical values of the quadratic transformation 
f@=2+4+1. : 


4.4. Draw a side view of a man with an arm stretched out in front of him, holding 
a knife. The blade should point down. Choose the origin of coordinates to be his 
navel. Draw another picture to explain how hara-kiri can be achieved by applying 
the inverse of the quadratic transformation f(z) = z? to your image. 
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4.5. Find a similitude that approximates the behavior of the given analytic transfor- 
mation in the vicinity of the given point: (a) f(z) = z* near z = 1; (b) f(z) =1/z 
near 2) = 1+; (c) f(z) = (z — 1)? near z = 1 —i. 


5 How to Change Coordinates 


In describing transformations on spaces we usually make use of an underlying coor- 
dinate system. Most spaces have a coordinate system by means of which the points 
in the space are located. This underlying coordinate system is implied by the specifi- 
cation of the space: for example, X = [1, 2] provides a collection of points together 
with the natural coordinate x restricted by | < x < 2. We can think of either the 
space, made of points x € X, or equivalently the system of coordinates. If the space 
X is R* or C, then the underlying coordinate system may be Cartesian coordinates. 
If X = C, then the coordinate system may be angular coordinates on the sphere. 

In each case the underlying coordinate system is itself a subset of a metric space. 
We denote this metric space by Xc. Usually we do not consciously distinguish 
between a point x € X and its coordinate x € Xc. Notice, however, that the space 
Xc may contain points (coordinates) that do not correspond to any point in the space 
X. For example, in the case of the space X = mit is natural to take Xc = R’; then 
points x € X in the space correspond to coordinates x = (x1, x2) € Xc restricted by 
0 < x; < 1 and 0 < x2 < 1. However, the coordinates (3, 5) € Xc do not correspond 
to a point in X. We would like the reader to think of the space itself as “lying above” 
its coordinate system, as suggested in Figure III.53. 

A change of coordinate system may be described by a transformation 6 : Xc > 
Xc. We can think of a change of coordinates being effected by physically moving 
each point x € X so that it no longer lies above x € Xc but instead above the coordi- 
nate x’ = 0(x) € Xc. Thus we must now distinguish between a point x lying in the 
space, X, from its coordinate x € Xc. Then we want to think of the change of coor- 
dinates 6 : Xc > Xc as moving X relative to the underlying coordinate space Xc, 
as illustrated in Figure ITI.54. 


Example 


5.1. Let X =[1, 2] and take Xc to be R. Let 0: R— R be defined by 0(x) = 
2x + 1. Then the coordinate of the point x = 1.5 becomes changed to 4. We want 
to think of the space X as being moved relative to the coordinate space Xc, which is 
held fixed , as illustrated in Figure III.55. 


Let 0 : Xc — Xc denote a change of coordinates. In order that the new coordinate 
system be useful, it is usually necessary that 0, treated as a transformation from X 
to 9(X), be one-to-one and onto, and hence invertible. Let f : X — X be a transfor- 
mation on a metric space X. We want to consider how the transformation f should 
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Figure [II.53. The un- 
derlying coordinate sys- 
tem Xc for the space X. 


Y Xe THE UNDERLYING COORDINATE 


be expressed after the change of coordinates. Let x denote simultaneously a point 
in X and the coordinates of that point. Let f(x) denote simultaneously the point to 
which x is transformed by f, and the coordinates of that point. Let x’ denote the 
point x € X in the new coordinate system. That is, x’ = 6(x) € Xc denotes the new 
coordinates of the point x. Let f’(x’) denote the same transformation f : X > X, 
but expressed in the new coordinate system. Then the relation between the two co- 
ordinate systems is expressed by the commutative diagram in Figure III.57, and is 
illustrated in Figure III.56. 


Theorem 5.1 Let X be a space and let Xc > X be a coordinate space for X. 
Let a change of coordinates be provided by a transformation 0 : Xc > Xc . Let 0 be 
invertible when treated as a transformation from X to 0(X). Let the coordinates of 
a point x € X be denoted by x before the change of coordinates, and by x’ after the 
change of coordinates, so that 


x =0(2). 
Let f :X — X be a transformation on the space X. Let x +> f (x) be the formula 


for f expressed in the original coordinates. Let x' +> f’(x’') be the formula for f 
expressed in the new coordinates. Then 
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Figure 111.54. A Fin a x 


change of coordinates 
in terms of X and Xc. 
We think of X as being 
removed relative to the 
underlying coordinate 
space Xc. 


f= of oO), 
£@)=O6.0f 007')(x’). 


Examples & Exercises 


5.2. Consider an affine transformation f(x) = ax +b,a4#0,a#1,a,beR. This 
has a fixed point x, € R defined by f(x) = xy. We find x7 = b/(1 — a). xf is 
clearly the interesting point in the action of an affine transformation on R. Accord- 
ingly let us change coordinates to move x to the origin: that is x’ = 0(x) =x — xy. 
What does f look like in this new coordinate system? 


fx) = 00 f 00") =00(' +x) =alx’ + x5) +b —xF; 


f'(x’) = ax’, which is simply a rescaling! Now using the first formula we get 
y & 


Figure II1.55. A x e(x) 

change of coordinates osc oes 

for the space [1, 2] given 2 GL ae aa eee 
by the transformation 

x = O(x) = 2x + 1. 0 ! 2 3 : 5 : 
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Figure 111.56. The 
transformation F acting 
on X is equivalent to F’ 
acting on 0(X). 


f(x) =ax — xp) + xz 


and 

f(x) =a"(x — Xp) +X¢ _for all n € {0, 1, 2,3,...}. 
We now see a new way of visualizing an affine transformation on R: for example, if 
a > 1, we see the image in Figure III.58. 
5.3. Show that for any affine transformation f(x) :R? — R* given by f(x) = 
Ax +t, with fixed point x, that the coordinate transformation 6(x) = x — x trans- 
forms the function f’(x’) = Ax’. 
5.4. Let X = [1, 2] and let a change of coordinates be defined by x’ = 2x — 1. Leta 


transformation f : X — X be defined by f(x) = (x — 1)? +1. Express f in the new 
coordinate system. 


Definition 5.1 Let f : X — X be a transformation on a metric space. A point 
x¢ € X such that f (x) =x is called a fixed point of the transformation. 


The fixed points of a transformation are very important. They tell us which parts 
of the space are pinned in place, not moved, by the transformation. The fixed points 
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of a transformation restrict the motion of the space under nonviolent, nonripping 
transformations of bounded deformation. 


Examples & Exercises 
5.5. Find the fixed points x; and x2 of the Mobius transformation 


(z + 2) 


be ees, 


on C. Make a change of coordinates so that x; becomes the origin and x2 becomes 
the point at infinity. Hence interpret the action of f(z) on the sphere in geometrical 
terms. 


5.6. Let W(x) = Ax +t where det A $ 0 is a two-dimensional affine transforma- 
tion acting on the space X = R’. Find the fixed point x. Change coordinates so 
that x becomes the origin of coordinates. Hence describe the action geometri- 
cally of a two-dimensional, nondegenerate affine transformation. What can happen 
if det A = 0? 


5.7. Suppose we can find a coordinate transformation BAB~' = D, where D is a 
diagonal matrix we denote by 


Figure 111.57. Com- 
mutative diagram for 
the coordinate change 
6; Xc — Xc. 


ORIGINAL 
COORDINATES 


NEW 
COORDINATES 


Figure 111.58. An affine Xs 
transformation on R. We 
see rescaling (magnifica- 
tion or diminution) cen- 
tered at the fixed point, 
together with a flip of 180° 
ifa <0. 
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“tie 3 
p=(% 0). 


Show that A, and A, satisfy the equation 


a(*-* A} bd sad: 


g h—-ix 
5.8. Analyze the behavior of the affine transformation w(z) = 7z + 1 on C near the 
point at infinity by making the change of coordinates h(z) = 1/z. 


=)? —trAd+det A =0. 


5.9. Two one-parameter families of transformations on R are Su = x? — uw and 
8,(x) = Ax(1 — x), where yw and A are real parameters. Find a change of coordinates 
and a function p = (A) so that 2 Q) (x’) = g,(x’) is valid for an appropriate interval 
on the A-axis. 

5.10. Find the real fixed points of g(x) = x? — 5 Analyze the behavior of g near 
each of its fixed points by changing coordinates so as to move first one then the other 
to the origin. Another method for looking at the behavior of g near a fixed point is 
to approximate g(x) by the first two terms of its Taylor series expansion about the 
fixed point. Compare these methods. 


5.11. Suppose that the 2 x 2 matrix 


(5 


satisfies the condition (tr A)? — 4det A > 0. Show that there is a B such that 
BAB ‘=D, 
where D is a diagonal matrix. Furthermore, show that one choice for B is given by 
ee 
p= A,—h 
( i 
da—h 
What do you think happens if (tr A)* — 4det A < 0? 
5.12. Let w : R* > R? denote the affine transformation 
a Xj — tr 2 x1 < 1 
TORI 3 bo) FF 
Make a change of coordinates so that the transformation is simply a coordinate 


rescaling. What are the rescaling factors? 


Definition 5.2 Let F denote a set of transformations on a metric space X. F 
is called a semigroup if f, g € F implies fog é€ F. F is called a group if it is a 
semigroup of invertible transformations, and f € F implies f— ¢€ F. 

We introduce this definition because we will use semigroups (and groups) of 


transformations both to characterize and to compute fractal subsets of X. However, 
we do not use any deep theorems from group theory. 
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Examples & Exercises 


5.13. Let f : X — X be a transformation on a metric space. Show that the set of 
transformations { f°” : n=0, 1, 2, 3, ...} forms a semigroup. 


5.14. A transformation T : £ — & on code space is defined by 

TD (Xa XG 0 aXaXs me XAKSXENGXG..- 
and is called a shift operator. Describe the semigroup of transformations {T° :n = 
0, 1,2,3,...}. What are the fixed points of T°? if the code space is built up from the 
two symbols {0, 1}? 
5.15. Show that the set of Mébius transformations on R forms a group. 
5.16. Show that the set of Mébius transformations on C forms a group. 


5.17. Show that the set of invertible affine transformations on R? forms a group. 


5.18. Show that the set of transformations f : R? > R? such that f(A) C A forms 
a semigroup. 


5.19. Show that a group of transformations is provided by the set of affine trans- 
formations of the form w(x) = Ax +t, where A = » 7 for a,b,c € R, with 
ac # 0, and the translation vector ¢ is arbitrary. 


5.20. The most general analytic quadratic transformation f : C > C can be ex- 
pressed by a formula of the form f(z) = Az* + Bz+C, where A, B, and C are 
complex numbers, and A # 0. Show that by means of a suitable change of coordi- 
nates, z’ = 0(z), where @ is a similitude, show that f(z) can be reexpressed as a 
quadratic transformation of the special form f’(z) = (z')2 + C for some complex 
number C. 


6 The Contraction Mapping Theorem 


Definition 6.1 A transformation f :X — X ona metric space (X, d) is called 
contractive or a contraction mapping if there is a constant 0 < s < 1 such that 


d( f(x), f(y)) <s- d(x, y)Vx, y eX. 
Any such number s is called a contractivity factor for f. 


It would be convenient to be able to talk about the largest number and the smallest 
number in a set of real numbers. However, a set such as S = (—o0, 3) does not 
possess either. This difficulty is overcome by the following definition. 


Definition 6.2 Let S denote a set of real numbers. Then the infimum of S is 
equal to —oo if S contains negative numbers of arbitrarily large magnitude. Other- 
wise the infimum of S = max{x € R: x <s forall s € S}. The infinum of S always 
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exists because of the nature of the real number system, and it is denoted by inf S. 
The supremum of S is similarly defined. It is equal to +00 if S contains arbitrarily 
large numbers; otherwise it is the minimum of the set of numbers that are greater 
than or equal to all of the numbers in S. The supremum of S always exists, and it is 
denoted by sup S. 


Examples & Exercises 


6.1. Find the supremum and the infimum of the following sets of real numbers: (a) 
(—oo, 3); (b) C, the Classical Cantor Set; (c) {1, 2,3, 4, ...}; (d) the positive real 
numbers. 


6.2. Let f : X — X be a contraction mapping on a compact metric space (X, d). 
Show that inf{s € R : s is a contractivity factor for f} is a contractivity factor for f. 


6.3. Show that if f : X — X and g : X —> X are contraction mappings on a space 
(X, d), with contractivity factors s and t, respectively, then f o g is a contraction 
mapping with contractivity factor st. 


Theorem 6.1 /[(The Contraction Mapping Theorem).] Let f :X — X be acon- 
traction mapping on a complete metric space (X,d). Then f possesses exactly one 
fixed point xs € X and moreover for any point x € X, the sequence {f"(x):n= 
0, 1,2, ...} converges to x ¢. That is, 


lim f°"(x) =x, for each x € X. 
n->©o 


Figure III.59 illustrates the idea of a contractive transformation on a compact 
metric space. 


Proof Let x € X. Let 0 <s < 1 be acontractivity factor for f. Then 
aS Wit OO) S83" "dG fe) (1) 


for all m,n =O, 1,2,..., where we have fixed x € X. The notation u A v denotes 
the minimum of the pair of real numbers u and v. In particular, fork =0,1,2,..., 
we have 


d(x, f*(x)) < d(x, f(®)) + FO), f°) +--+. d( f°), FH) 
<(l+sts?+---+s*")d(x, f(x) 
< (1—s)"'d(x, f(x)), 
so substituting into equation (1) we now obtain 
dfx), FO) sis" As rae @, £@)., 
from which it immediately follows that { f°"(x)}?°, is a Cauchy sequence. Since X 


is complete this Cauchy sequence possesses a limit x  € X, and we have 


lim f°"(x) = ay. 
n— oo 
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Figure 111.59. (a) Illus- 
trates the idea of a con- 
tractive transformation on 
a metric space: (b) A con- 
traction mapping doing 
its work, drawing all of a 
compact metric space X 
toward the fixed point. 


(a) 


(b) 


Now we shall show that x; is a fixed point of f. Since f is contractive it is continu- 
ous and hence 


f (xp) = flim, f°" (x) = lim fF) = xy. 


Finally, can there be more than one fixed point? Suppose there are. Let x and yy be 
two fixed points of f. Thenx, = f(xy), yp = f(ys), and 


a(xs, yf) =A(f (xz), f(xp)) < sd(xz, yp), 
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where (1 — s)d(x+, yr) < 0, which implies d(x, y¢) = 0 and hence x = yy. This 
completes the proof. 


Examples & Exercises 


6.4. Let w(x) = Ax +t be an affine transformation in two dimensions. Make the 
change of coordinates h(x) = x’ = x — xr, under the assumption that det(J — A) 4 
QO, and show that w’(x’)=howoh!(x’) = Ax’, that w(x) = (h7! ow’ oh)(x) = 
A(x — x¢) +x, and hence that 


w(x) = A"(x —x¢) + x¢ forn = 0; t2- 3... (2) 


Give conditions on A such that it is contractive (a) in the Euclidean metric, and (b) in 
the Manhattan metric. Show that if |A| < 1, where |A| denotes any appropriate norm 
of A viewed as a linear operator on a two-dimensional vector space, then {w°"(x)} 
is a Cauchy sequence that converges to x +, for each x € R?. 


6.5. Let f :m— m@ be a contraction mapping on (@, Euclidean). Show that Fig- 
ure III.59 gives the right idea. 


6.6. Let f:R— R be the affine transformation f(x) = 3x + 4. Verify f is a 
contraction mapping and deduce 


lim f(x) =X, for each x € R. 
n—>0oO 


Use this formula with x = 1 to obtain a geometrical series for the fixed point x + € R. 
Observe, however, f (IR) = R; indeed f is invertible. 


6.7. Let (X, d) be a compact metric space that contains more than one point. Show 
that the situation in exercise 6.6 cannot occur for any contraction mapping f : XK > 
X. That is, show that f(X) C X but f(X) # X. That is, show that a contraction 
mapping on a nontrivial compact metric space is not invertible. Hint: use the com- 
pactness of the space to show that there is a point in the space that is farthest away 
from the fixed point. Then show that there is a point that is not in f (X). 


6.8. Show that the set of contraction mappings on a metric space forms a semi- 
group. 
6.9. Show that the affine transformation w : A > A defined by w(x) = Ax + tisa 
contraction, where 

1 re) as ° 1 

scos120° —-=sin 120 = 

AGA S ‘ dis. 4]. 
(in 120° 3 cos 120° ) = (3) 

Here A is an equilateral Sierpinski triangle with a vertex at the origin and one at 
(1, 0). You need to begin by verifying that w does indeed map A into itself! Locate 
the fixed point xs. Make a picture of this contraction mapping “doing its work, 
mapping all of the compact metric space A toward the fixed point.” Use different 
colors to denote the successive regions f° (A) \ f°+)(A) for n =0, 1, 2, 3,.... 
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Figure 111.60. The ex- 
istence of a positive 
eigenvalue of an “angle- 
squeezing” linear transfor- 
mation. 


ene 


VA 


Wu 
1 MTT 


6.10. Define a mapping on the code space of two symbols {0, 1} by f (x1x2x3x4...) 
= 1x1x2x3x4.... (Recall that the metric is d(x, y) = some aa or equivalent.) 
Show that f is a contraction mapping. Locate the fixed point of /f. 

6.11. Let (X,d) be a compact metric space, and let f : X — X be a contraction 
mapping. Show that { f°”(X)}°) is a Cauchy sequence of points in (H(X), h) and 
limyn—+>oo f°" (X) = {xf}, where x is the fixed point of f. 

6.12. Let (X,d) be a compact metric space. Let f : X + X have the property 
limn—oo f°" (KX) = xy. Find a metric d on X such that f is a contraction mapping, 
and the identity is a homeomorphism from (X, d) > (X, d). 


6.13. Let Ax =(° ? 
C<¢@ 


ear transformation on R*. Show that A maps the positive quadrant {(x1, x2) : x1 = 
0, x2 > 0} into itself. Let a mapping f : [0, 90°] — [0, 90°] be defined by 


cos? \ _ = cos f (0) 
A ‘- 9 ) = (some positive number) ( i (0) ) 


Show that { f°"(@)} converges to the unique fixed point of f. Deduce that there 


) (21) with 0, ¢,¢ © Rall strictly positive, be a lin 
< 


exists a unique positive number A, and an angle 0 < 6 < 90° such that A [= } a 


A _™ 3 ) See Figure III.60. 
sin 6 
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7 Contraction Mappings on the Space of Fractals 


Let (X, d) be a metric space and let (71(X), h(d)) denote the corresponding space 
of nonempty compact subsets, with the Hausdorff metric h(d). We introduce the 
notation h(d) to show that d is the underlying metric for the Hausdorff metric h. 
For example, we may discuss (H(C), h(spherical)) or (H(R*), h(Manhattan)). We 
will drop this additional notation when we evaluate Hausdorff distances. 

We have repeatedly refused to define fractals: we have agreed that they are sub- 
sets of simple geometrical spaces, such as (R*, Euclidean) and (C, Spherical). If 
we were to define a deterministic fractal, we might say that it is a fixed point of a 
contractive transformation on (H{(X), h(d)). We would require that the underlying 
metric space (X, d) be “geometrically simple.” We would require also that the con- 
traction mapping be constructed from simple, easily specified, contraction mappings 
on (X, d), as described below. 


Lemma 7.1 Let w:X— X be a contraction mapping on the metric space 
(X, d). Then w is continuous. 


Proof Let « > 0 be given. Let s > 0 be a contractivity factor for w. Then 
d(w(x), w(y)) <sd(x, y) <€ 
whenever d(x, y) < 6, where 5 = €/s. This completes the proof. 
Lemma 7.2 Let w:X — X be a continuous mapping on the metric space 
(X, d). Then w maps H(X) into itself. 


Proof Let S be a nonempty compact subset of X. Then clearly w(S) = {w(x): 
x € S} is nonempty. We want to show that w($) is compact. Let {y, = w(x,)} be 
an infinite sequence of points in S. Then {x,} is an infinite sequence of points in S. 
Since S is compact there is a subsequence {xy,} that converges to a point x € S. But 
then the continuity of w implies that {yy, = f(xy,)} is a subsequence of {y,} that 
converges to y = f(x) € w(S). This completes the proof. 


The following lemma tells us how to make a contraction mapping on (H((X), h) 
out of a contraction mapping on (X, d). 


Lemma 7.3 Let w:X— X be a contraction mapping on the metric space 
(X, d) with contractivity factor s. Then w : H(X) — H(X) defined by 


w(B) = {w(x):x € B}VB € H(X) 
is a contraction mapping on (H(X), h(d)) with contractivity factor s. 


Proof From Lemma 7.1 it follows that w : X — X is continuous. Hence by 
Lemma 7.2 w maps #{(X) into itself. Now let B, C € H(X). Then 
d(w(B), w(C)) = max{min{d(w(x, y), w(y)): y EC}: x € B} 
< max{min{s -d(x, y):yEC}:x € B)=s-d(B,C). 
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Similarly, d(w(C), w(B)) <s-d(C, B). Hence 
h(w(B), w(C)) = d(w(B), w(C)) Vv d(w(C), w(B)) < s- d(B, C) Vv d(C, B) 
<s-d(B,C). 
This completes the proof. 


The following lemma gives a characteristic property of the Hausdorff metric 
which we will shortly need. The proof follows at once from exercise 6.13 of Chapter 
Il. 


Lemma 7.4 For all B,C, D, and E, in H(X) 
h(B UC, DUE) <h(B, D) VAC, E), 
where as usual h is the Hausdorff metric. 


The next lemma provides an important method for combining contraction map- 
pings on (H(X),/) to produce new contraction mappings on (7{(X), h). This 
method is distinct from the obvious one of composition. 


Lemma 7.5 Let (X, d) be a metric space. Let {wy :n =1,2,..., N} be con- 
traction mappings on (H(X), h). Let the contractivity factor for w, be denoted by Sp, 
for each n. Define W : H(X) > H(X%) by 

W(B) = w,(B) U w2(B) U...U w,(B) 
= UP _,wn(B), for each B € H(X). 


Then W is a contraction mapping with contractivity factor s = max{s, :n = 1, 2, 
wee ee 


Proof We demonstrate the claim for N = 2. An inductive argument then com- 
pletes the proof. Let B, C € H(X). We have 


h(W(B), W(C)) = h(wi(B) U wo(B), wi(C) U w2(C)) 
< h(w;(B), w1(C)) V h(w2(B), w2(C)) (by Lemma 7.2) 
< s5,h(B,C) V s2h(B, C) < sh(B,C). 
This completes the proof. 


Definition 7.1 A (hyperbolic) iterated function system consists of a complete 
metric space (X,d) together with a finite set of contraction mappings Wy, : X —> 
X, with respective contractivity factors s,, forn=1,2,...,.N. The abbreviation 
“IFS” is used for “iterated function system.’ The notation for the IFS just an- 
nounced is {X; w,,n = 1, 2,..., N} and its contractivity factor is s = max{s,:n = 
12,5555 007. 


We put the word “hyperbolic” in parentheses in this definition because it is some- 
times dropped in practice. Moreover, we will sometimes use the nomenclature “IFS” 
to mean simply a finite set of maps acting on a metric space, with no particular con- 
ditions imposed upon the maps. 

The following theorem summarizes the main facts so far about a hyperbolic IFS. 
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Theorem 7.1 Let {X; w,,n =1,2,...,.N} be a hyperbolic iterated function 
system with contractivity factor s. Then the transformation W :H(X) — H(X) de- 
fined by 


W(B) = Un: Wn(B) 


for all B € H(X), is a contraction mapping on the complete metric space (H(X), 
h(d)) with contractivity factor s. That is 


h(W(B), W(C)) <s-h(B,C) 
for all B, C € H(X). Its unique fixed point, A € H(X), obeys 
A = W(A) = U"_,w,(A) 
and is given by A = liMn+o W"(B) for any B € H(X). 


Definition 7.2 The fixed point A € H(X) described in the theorem is called the 
attractor of the IFS. 


Sometimes we will use the name “attractor” in connection with an IFS that is 
simply a finite set of maps acting on a complete metric space X. By this we mean 
that one can make an assertion analagous to the last sentence of Theorem 7.1. 

We wanted to use the words “deterministic fractal” in place of “attractor” in 
Definition 7.2. We were tempted, but resisted. The nomenclature “iterated function 
system” is meant to remind one of the name “dynamical system.” We will introduce 
dynamical systems in Chapter 4. Dynamical systems often possess attractors, and 
when these are interesting to look at they are called strange attractors. 


Examples & Exercises 


7.1. This exercise takes place in the metric spaces (R, Euclidean) and (H(R), 
h(Euclidean)). Consider the IFS {R; w;, w2}, where w)(x) = 5x and w2(x) = zx = 
- Show that this is indeed an IFS with contractivity factor s = ‘. Let Bo = [0, 1]. 
Calculate B, = W°"(Bo),n = 1, 2, 3,.... Deduce that A = lim,_,.. B, is the classi- 
cal Cantor set. Verify directly that A = 4A U {4A + 3}. Here we use the following 
notation: for a subset A of R,xA = {xy: ye A}andA+x={y+x: yeEA}. 


7.2. With reference to example 7.1, show that if w(x) =s5,x and w2(x) = (1 — 
51)x + 51, where s; is anumber such that 0 < s; < 1, then B; = B, = B3=.... Find 
the attractor. 


7.3. Repeat example 7.1 with w)(x) = 4x and w2(x) = 5x + 5. In this case A = 
lim, oo B, will not be the classical Cantor set, but it will be something like it. 
Describe A. Show that A contains no intervals. How many points does A contain? 


7.4. Consider the IFS {R; qx +- 3, 5x, ix es i}. Verify that the attractor looks like 
the image in Figure III.61. Show, precisely, how the set in Figure IIT.61 is a union of 
three “shrunken copies of itself.” This attractor is interesting: it contains countably 
many holes and countably many intervals. 
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Figure IlI.61. Attractor 
for three affine maps on 
the real line. Can you find 


the maps? 
| 
Figure 111.62. A se- \ 
quence of sets converging 
to a line segment. 
1 
Ao 
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7.5. Show that the attractor of an IFS having the form {R; w;(x) = ax +b, w2(x) = 
cx + d}, where a, b,c, and d € R, is either connected or totally disconnected. 


7.6. Does there exist an IFS of three affine maps in R? whose attractor is the union 
of two disjoint closed intervals? 


7.7. Consider the IFS 


mG G)*()-(0 YG} 


Let Ao = {(5, y):0 < y < 1}, and let W°"(Ao) = An, where W is defined on (R72) 
in the usual way. Show that the attractor is A = {(x, y):x = y,0 <x < 1} and 
that Figure III.62 is correct. Draw a sequence of pictures to show what happens if 
Ao = {(x, y) € R?:0<x<1,0<y< I}. 


7.8. Consider the attractor for the IFS {R; w(x) = 0, w2(x) = ox a +}. Show that 
it consists of a countable increasing sequence of real points {x,:n =0,1,2,...} 
together with {1}. Show that x, can be expressed as the nth partial sum of an infinite 
geometric series. Give a succinct formula for x,,. 


7.9. Describe the attractor A for the IFS {[0, 2]; wi(x) = 3x7, w2(x) = 2x + 4} by 
describing a sequence of sets which converges to it. Show that A is totally discon- 
nected. Show that A is perfect. Find the contractivity factor for the IFS. 


7.10. Let (7,6),0 <r <0, 0 <@ < 2m denote the polar coordinates of a point 
in the plane, R?. Define w)(r, 0) = (57 + 5, 50), and wa(r, 0) = (Sr + 4, 460 + 
an), Show that {R?; w,, w2} is not a hyperbolic IFS because both maps w, and w2 
are discontinuous on the whole plane. Show that {R*; w;, w2} nevertheless has an 
attractor; find it (just consider r and @ separately). 


7.11. Show that the sequence of sets illustrated in Figure III.63 can be written in 
the form A, = W°"(Ao) forn = 1,2,..., and find W : H(R?) > H(R?). 


7.12. Describe the collection of functions that constitutes the attractor A for the IFS 
1 1 
{C[O, 1]; wi(f(x)) = 51), w2(f (x)) = 51%) + oat = 4), 
Find the contractivity factor for the IFS. 


7.13. Let C°[0, 1] = {f €¢ C[0,1]: f() = f(1) = 0}, and define d(f, g) = 
max{| f(x) — g(x)|: x € [0, 1]}. Define w, : C°[0, 1] > C°[0, 1] by (wi(f))(x) = 
+ f (2x mod 1) + 2x(1 — x) and (w2(f))(x) = +f (x). Show that {C°[0, 1]; wi, we} 
is an IFS, find its contractivity factor, and find its attractor. Draw a picture of the 
attractor. 


7.14. Find conditions such that the M6bius transformation w(x) = (ax + b)/(cz + 
d),a,b,c,d € C,ad — bc £0, provides a contraction mapping on the unit disk 


Figure [II.63. The first 


three sets Ap, A,;, and A, 
in a convergent sequence 
of sets in H(R*). Can 
you find a transformation 
W : H(R?) > H(R?) 
such that A,.,; = W(A,)? 
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X = {z € C: |z| < 1}. Find an upper bound for the contractivity factor. Construct an 
IFS using two Mobius transformations on X, and describe its attractor. 


7.15. Show that a Mobius transformation on C is never a contraction in the spheri- 
cal metric. 


7.16. Let (2X, d) be the code space of three symbols {0, 1, 2}, with metric 


ie, @) 
Xn = Yn 
d(x, y) = 2 pia 
Define w; : & — &X by w(x) = 0x) x2x3... and w2(x) = 2x)x2x3.... Show that w 
and wz are both contraction mappings and find their contractivity factors. Describe 
the attractor of the IFS {2; w), w2}. What happens if we include in the IFS a third 


transformation defined by w3x = 1x1x2x3...? 


7.17. Let A C R? denote the compact metric space constisting of an equilateral 
Sierpinski triangle with vertices at (0, 0), (1, 0), and G, a), and consider the IFS 
{A, 52+ 5, 507724 +} where we use complex number notation. Let Ao = A, 
and A, = W™(Ao) for n = 1, 2, 3,.... Describe A;, A2, and the attractor A. What 
happens if the third transformation w3(z) = 5Z + + + (/3/4)i is included in the 
IFS? 


8 Two Algorithms for Computing Fractals from iterated Function Systems 


In this section we take time out from the mathematical development to provide two 
algorithms for rendering pictures of attractors of an IFS on the graphics display de- 
vice of a microcomputer or workstation. The reader should establish a computer- 
graphical environment that includes one or both of the software tools suggested in 
this section. 

The algorithms presented are (1) the Deterministic Algorithm and (2) the Random 
Iteration Algorithm. The Deterministic Algorithm is based on the idea of directly 
computing a sequence of sets {A, = W°"(A)} starting from an initial set Ao. The 
Random Iteration Algorithm is founded in ergodic theory; its mathematical basis 
will be presented in Chapter [X. An intuitive explanation of why it works is pre- 
sented in Chapter IV. We defer important questions concerning discretization and 
accuracy. Such questions are considered to some extent in later chapters. 

For simplicity we restrict attention to hyperbolic IFS of the form {R?; w,:n = 
1,2,..., N}, where each mapping is an affine transformation. We illustrate the al- 
gorithms for an IFS whose attractor is a Sierpinski triangle. Here’s an example of 


such an IFS: 
. cE 635 =:0 x1 A l 
: X2 =~ 0 0.5 X2 as 
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“Lel=Lo aslla]*[so) 
“Lal=[o as]la]*[s] 


This notation for an IFS of affine maps is cumbersome. Let us agree to write 


Pe ee X1 <2 ae 
woem[]-[2 2] Le ]-Lg]eaees 


Then Table III.1 is a tidier way of conveying the same iterated function system. 

Table III.1 also provides a number p; associated with w; for i = 1, 2,3. These 
numbers are in fact probabilities. In the more general case of the IFS {X; w, :n = 
1,2,..., N}, there would be N such numbers {p; :i = 1, 2,..., N} that obey 


Pit Po2t+ p3+:-+>+ pn =1 and p; > 0 fon =e. NN 


These probabilities play an important role in the computation of images of the at- 
tractor of an IFS using the Random Iteration Algorithm. They play no role in the 
Deterministic Algorithm. Their mathematical significance is discussed in later chap- 
ters. For the moment we will use them only as a computational aid, in connection 
with the Random Iteration Algorithm. To this end we take their values to be given 
approximately by : 
|detA;| —_lajd; — Dc; | 

Die (Ail = Lies lad: — bees 
Here the symbol © means “approximately equal to.” If, for some i, det A; = 0, 
then p; should be assigned a small positive number, such as 0.001. Other situations 


should be treated empirically. We refer to the data in Table III.1 as an IFS code. 
Other IFS codes are given in Tables III.2, III.3, and III.4. 


Aigorithm 8.1 The Deterministic Algorithm. Let {X; w,, w2,..., wv} bea 
hyperbolic IFS. Choose a compact set Ag C R*. Then compute successively A, = 
W°"(A) according to 


An+1 = Ul_,w;(An) forn = ae eo 


fOr P= LZ: 2. 


i 


Thus construct a sequence {A, :n = 0, 1, 2,3, ...} C H(X). Then by Theorem 7.1 
the sequence {A,} converges to the attractor of the IFS in the Hausdorff metric. 


Table Ill.1. IFS code for a Sierpinski triangle. 


wy € 9 € d es. p 

i US 0 © -8s 1 1. 033 
2 05 =-0- 6---05 1<- 30° =0.33 
2.05 U0 03 3). Sos 
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Table I{11.2. IFS code for a square. 


“eee er ee ee i p 

I OS. 8 Gs f 0.25 
2 0S 0 0 O35: 48 1 0.25 
a. DS. 2 ae | 30 0.25 
+ Sf © 8 30 ee ZS 


Table 111.3. IFS code for a fern. 


w a b Cc d e a Pp 

oe, 0 0 0.16. 0:0 0.01 
2 085 0.04 -0.04 085 0O 16 °&# 0.85 
a D2 O26 O27 C2 0 16°" Oi 
4 -0.15 0.28 0.26 024 0 044 0.07 


Table 111.4. IFS code for a fractal tree. 


a 0 0 0.5 4 0.05 
2 042 -042 042 042 0 02 0.4 
3 042 042 -042 042 0 02 04 
4 0.1 0 0 0.1 0.02 -045 


We illustrate the implementation of the algorithm. The following program com- 
putes and plots successive sets A,+, starting from an initial set Ao, in this case a 
square, using the IFS code in Table III.1. The program is written in BASIC. It should 
run without modification on an IBM PC with Color Graphics Adaptor or Enhanced 
Graphics Adaptor, and Turbobasic. It can be modified to run on any personal com- 
puter with graphics display capability. On any line the words preceded by a ’ are 
comments and not part of the program. 


Program 1. (Example of the Deterministic Algorithm) 

screen 1 : cls ’initialize graphics 
dim s(100,100) : dim t(100,100) ’allocate two arrays of pixels 
a(1)=0.5:b(1)=0:c(1)=0:d(1)=0.5:e(1)=1:£(1)=1 ’input the IFS code 
a(2)=0.5:b(2)=0:c(2)=0:d(2)=0.5:e(2)=50:f (2) =1 
a(3)=0.5:b(3)=0:c(3)=0:d(3)=0.5:e(3)=25:f (3) =50 
for i=1 to 100 ’input the initial set A(0), in this case 

a square, into the array t(i,j) 
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t(i,1)=1: pset(i,i1) ’A(O) can be used as a condensation set 

t(1,i)=1:pset(1,i) ’A(O) is plotted on the screen 

t(100,i)=1:pset (100, i) 

t(i,100)=1:pset (i, 100) 

next: do 

for i=1 to 100 ’apply W to set A(n) to make A(nti) in the 

array s(i,j) 

for j=1 to 100 : if t(i,j)=1 then 

s(a(1)*itb(1)*jt+e(1) ,c(1)*it+td(1)*j+f(1))=1 ’and apply W to A(n) 

s(a(2)*i+b(2)*jt+e(2) ,c(2)*i+d(2)*j+f(2))=1 

s(a(3) *i+b(3) *j+e(3) ,c(3) *itd(3) *j+f (3) )=1 

end if: next j: next i 

cls ’clears the screen--omit to obtain sequence with a A(0) as 
condensation set (see section 9 in Chapter II) 

for i=1 to 100 : for j=1 to 100 

t(i,j)=s(i,j) ’put A(nt+1) into the array t(i,j) 

s(i,j)=0 ’reset the array s(i,j) to zero 

if t(i,j)=1 then 

pset(i,j) ’plot A(n+t1) 

end if : next : next 

loop until instat ’if a key has been pressed then stop, 
otherwise compute A(n+1)=W(A(n+1)) 


The result of running a higher-resolution version of this program on a Masscomp 
5600 workstation and then printing the contents of the graphics screen is presented 
in Figure III.64. In this case we have kept each successive image produced by the 
program. 

Notice that the program begins by drawing a box in the array ¢(i, j). This box has 
no influence on the finally computed image of a Sierpinski triangle. One could just 
as well have started from any other (nonempty) set of points in the array ¢t(i, j), as 
illustrated in Figure III.65. 

To adapt Program 1 so that it runs with other IFS codes will usually require 
changing coordinates to ensure that each of the transformations of the IFS maps the 
pixel array s(i, j) into itself. Change of coordinates in an IFS is discussed in exercise 
10.14. As it stands in Program 1, the array s(i, j) is a discretized representation of 
the square in R? with lower left corner at (1, 1) and upper right corner at (100, 100). 
Failure to adjust coordinates correctly will lead to unpredictable and exciting results! 


Algorithm 8.2 The Random Iteration Algorithm. Let {X; w;, w2,..., wv} 
be a hyperbolic IFS, where probability p; > 0 has been assigned to to w; for i = 
1,2,..., N, where eee Pj = 1. Choose xo € X and then choose recursively, inde- 


pendently, 


Xn € {wi(Xn-1), w2(Xn-1), eae) WN(Xn-1)} for n ares l, ei nF J | 
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where the probability of the event x, = w;(Xn—1) is p;. Thus, construct a sequence 
jin =O, 1, 2.552 


x The reader should skip the rest of this paragraph and come back to it after 
reading Section 9. If {X, wo, wi, w2,..., wy} is an IFS with condensation map wo 
and associated condensation set C C H(X), then the algorithm is modified by (a) 
attaching a probability po > 0 to wo, so now sear pi = 1; (b) whenever wo(x,,_1) 
is selected for some n, choose x, “at random” from C. Thus, in this case too, we 
construct a sequence {x, :n =0, 1, 2,...} of points in X. 

The sequence {x,,}P° , “converges to” the attractor of the IFS, under various con- 
ditions, in a manner that will be made precise in Chapter IX. 

We illustrate the implementation of the algorithm. The following program com- 
putes and plots a thousand points on the attractor corresponding to the IFS code in 
Table III.1. The program is written in BASIC. It runs without modification on an 


ec aa a i ly a, ee eet ee Be 2S 
Figure 111.64. The re- 

sult of running the De- 

terministic Algorithm 

(Program 1) with various 

values of N, for the IFS 

code in Table III.1. 


ue 


sl: 
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IBM PC with Enhanced Graphics Adaptor and Turbobasic. On any line the words 
preceded by a’ are comments: they are not part of the program. 


Program 2. (Example of the Random Iteration Algorithm) 
?Iterated Function System Data 


afi] = 0.5 : b[i] =0 : cf{i] =0 : d{i] =.5 : e[{1i] =1 : £[1] =1 
a(2] = 0.5 : b[2] =0 : c[2] =0 : d{2] =.5 : e(2] =50 : £[2] =1 
a({3] = 0.5 : b[3] =0 : c[3] =0: d({3] =.5 : e[3] =50 : £[3] =50 


screen 1 : cls ’initialize computer graphics 
window (0,0)-(100,100) ’set plotting window to 0<x<100, 0<y<100 
x =0 : y = O: numits =1000 ’initialize (x,y) and define 

the number of iterations, numits 


Figure 111.65. The re- 
sult of running the Deter- 
ministic Algorithm (Pro- 
gram 1), again for the IFS 
code in Table III.1, but 
starting from a different 
initial array. The final re- 
sult is always the same! 
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Figure 111.66. The result of running the Random Iteration Algorithm for increasing 
numbers of iterations. The randomly dancing point starts to suggest the structure of the 
attractor of the IFS given in Table III.3. 


for n =i to numits ’Random Iteration begins! 
k = int(3*rnd-0.00001) +1 ’choose one of the numbers 1, 2, 
and 3 with equal probability 


,apply affine transformation number k to (x,y) 
newx =a(k]*x+b[k]*yt+te[k] : newy =c[k]*x+d[k] *y+f [k] 


X =newx : y =newy ’set (x,y) to the point thus obtained 

if n > 10 then pset (x,y) ’plot (x,y) after the first 10 
iterations 

next : end 


The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure III.66. 
Notice that if the size of the plotting window is decreased, for example by replacing 
the window call by WINDOW (0,0)-(50,50), then only a portion of the image is 
plotted, but at a higher resolution. Thus we have a simple means for “zooming in” 
on images of IFS attractors. The number of iterations may be increased to improve 
the quality of the computed image. 


Examples & Exercises 


8.1. Rewrite Programs 1 and 2 in a form suitable for your own computer environ- 
ment, then run them and obtain hardcopy of the output. Compare their performance. 


8.2. Modify Programs 1 and 2 so that they will compute images associated with the 
IFS code given in Table III.2. 
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8.3. Modify Program 2 so that it will compute images associated with the IFS codes 
given in Tables III.3 and III.4. 


8.4. By changing the window size in Program 2, obtain images of “zooms” on the 
Sierpinski triangle. For example, use the following windows: (1, 1) — (50, 50); C1, 
1) — (25, 25); (1, 1) — (12, 12); ...; (1, 1) — (NV, N). How must the total number of 
iterations be adjusted as a function of N in order that (approximately) the number 
of points that land within the window remains constant? Make a graph of the total 
number of iterations against the window size. 


8.5. What should happen, theoretically, to the sequence of images computed by 
Program 1 if the set Ag is changed? What happens in practice? Make a computa- 
tional experiment to see if there is any difference in say Ajo corresponding to two 
different choices for Ao. : 


8.6. Rewrite Program 2 so that it applies the transformation w; with probabil- 
ity p;, where the probabilities are input by the user. Compare the number of it- 
erations needed to produce a “good” rendering of the Sierpinski triangle, for the 
cases (a) p; = 0.33, p2 = 0.33, p3 = 0.34; (b) p; = 0.2, po = 0.46, p3 = 0.34; (c) 
r= 0.1, P2= 0.56, p3= 0.34. 


en  ———KV 


9 Condensation Sets 


There is another important way of making contraction mappings on 7{(X). 


Definition 9.1 Let (X,d) be a metric space and let C € H(X). Define a trans- 
formation wo : H(X) > H(X) by wo(B) = C for all Be H(X). Then wo is called a 
condensation transformation and C is called the associated condensation set. 


Observe that a condensation transformation wo : H(X) — H(X) is a contraction 
mapping on the metric space (H1(X), h(d)), with contractivity factor equal to zero, 
and that it possesses a unique fixed point, namely the condensation set. 


Definition 9.2 Let {X; wi, w2,..., Wn} be a hyperbolic IFS with contractivity 
factor 0 <s <1. Let wo: H(X) > H(X) be a condensation transformation. Then 
{X; wo, Wi, ..., Wn} is called a hyperbolic IFS with condensation, with contractivity 
factor s. 


Theorem 7.1 can be modified to cover the case of an IFS with condensation. 


Theorem 9.1 Let {X; w, :n =0,1,2,..., N} be a hyperbolic iterated function 
system with condensation, with contractivity factor s. Then the transformation W : 
H(X) — H(X) defined by 


W(B) = U"_)w,(B)VB € H(X) 
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Figure Ill.67. A geo- 
metric series of pine trees, 
the attractor of an IFS with 
condensation. 


ss 
= 


is a contraction mapping on the complete metric space (H(X), h(d)) with contrac- 
tivity factor s. That is 


h(W(B),W(C)) <s- h(B, C)VB,C € H(X). 
Its unique fixed point, A € H(X), obeys 
A = W(A) =U" _jwn(A) 
and is given by A =limy_..0 W°"(B) for any B € H(X). 
Examples & Exercises 


9.1. A sequence of sets {A, C X}®,, where (X, d) is a metric space, is said to be 
increasing if Ag C Aj C A2 C --- and decreasing if Ag > A; > A> D-:«. The in- 
clusions are not necessarily strict. A decreasing sequence of sets {A, C H(X)}% 


n=0 
is a Cauchy sequence (prove it!). If X is compact then an increasing sequence of 
sets {A, C H(X)}"°) is a Cauchy sequence (prove it!). Let {X; wo, wi,..., Wn} be 


a hyperbolic IFS with condensation set C, and let X be compact. Let Wo(B) = 
Ur _oWn(B)VB € H(X) and let W(B) = Ui_jUn(B). Define {C, = Wo" (C)} re 5. 
Then Theorem 9.1 tells us {C,,} is a Cauchy sequence in #((X) that converges to 
the attractor of the IFS. Independently of the theorem observe that 


C= CUWIO UW ...UW"(C) 


provides an increasing sequence of compact sets. It follows immediately that the 
limit set A obeys Wo(A) = A. 


9.2. This example takes place in (R*, Euclidean). Let C = = Ao C R? 


denote a set that looks like a scorched pine tree standing at the origin, with its trunk 
perpendicular to the x-axis. Let 
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0 (5)=(°o° os) (5)+(°e"): 


Show that {R?; wo, w;} is an IFS with condensation and find its contractivity factor. 
Let A, = W°"(Ao) for n = 1,2,3,..., where W(B) =U>_)w,(B) for B € H(R?). 
Show that A, consists of the first (n + 1) pine trees reading from left to right in 
Figure III.67. If the first tree required 0.1% of the ink in the artist’s pen to draw, and 
if the artist had been very meticulous in drawing the whole attractor correctly, find 
the total amount of ink used to draw the whole attractor. 


9.3. What happens to the trees in Figure III.67 if w; (* ) is replaced by 
= rae Ee eo 
eT Ve” OTS FS 0 


9.4. Find the attractor for the IFS with condensation {R*; wo, wi}, where the 
condensation set is the interval [0, 1] and w;(x) = $x + 2. What happens if w;(x) = 
1 


ax? 
5x: 


9.5. Find an IFS with condensation that generates the treelike set in Figure III.68. 
Give conditions on r and 6 such that the tree is simply connected. Show that the tree 
is either simply connected or infinitely connected. 


in exercise 9.2? 


9.6. Find an IFS with condensation that generates Figure III.69. 


9.7. You are given a condensation map w(x) in R* that provides the largest tree 
g Pp p g 


Figure 111.68. Sketch 
of a fractal tree, the at- 
tractor of an IFS with 
condensation. 
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Figure 11.69. An end- 
less spiral of little men. 


in Figure III.46. Find a hyperbolic IFS with condensation, of the form {R7; wo, wy, 
w2}, which produces the whole orchard. What is the contractivity factor for this IFS? 
Find the attractor of the IFS {R?; w), wo}. 


9.8. Explain why removing the command that clears the screen (“cls”) from Pro- 
gram 1 will result in the computation of an image associated with an IFS with con- 
densation. Identify the condensation set. Run your version of Program 1 with the 
“cls” command removed. 


10 How to Make Fractal Models with the Help of the Collage Theorem 


The following theorem is central to the design of IFS’s whose attractors are close to 
given sets. 


Theorem 10.1 (The Collage Theorem, (Barnsley 1985b)). Ler (X, d) be 
a complete metric space. Let L € H(X) be given, and let € > 0 be given. Choose an 
IFS (or IFS with condensation) {X; (wo), W1, W2,..., Wn} with contractivity factor 
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0 <s < 1,50 that 
h(L,U",., Wa(L)) <€, 
(n=0) 
where h(d) is the Hausdorff metric. Then 


h(L, A) < e/(1 aoe. S), 
where A is the attractor of the IFS. Equivalently, 


h(L, A) <(1—s)'A(L,U",_, wa(L)) —_—for all L € H(X). 
(n=0) 

The proof of the Collage Theorem is given in the next section. The theorem 
tells us that to find an IFS whose attractor is “close to” or “looks like” a given 
set, one must endeavor to find a set of transformations—contraction mappings on a 
suitable space within which the given set lies—such that the union, or collage, of the 
images of the given set under the transformations is near to the given set. Nearness 
is measured using the Hausdorff metric. 


Examples & Exercises 


10.1. This example takes place in (R, Euclidean). Observe that [0, 1] = [0, 5] U 
[5 1]. Hence the attractor is [0, 1] for any pair of contraction mappings w,;: R—> R 
and w2: R —> R such that w),((0, 1]) = [0, 5] and w2([{0, 1]) = [5, 1]. For example, 
w(x) = 5x and w2(x) = 5x + : does the trick. The unit interval is a collage of two 
smaller “copies” of itself. 


10.2. Suppose we are using a trial-and-error procedure to adjust the coefficients 
in two affine transformations w,(x) = ax + b, w2(x) =cx +d, where a,b,c,dé€ 
R, to look for an IFS {R; w,, w2} whose attractor is [0, 1]. We might come up 
with w;(x) = 0.51x — 0.01 and w2(x) = 0.47x + 0.53. How far from [0, 1] will the 
attractor for the IFS be? To find out compute 


h ([0, 1], Ux, wi (9, 1])) = h([0, 1], [-0.0/, 0.5] U (0,53, 1]) = 0.015 


and observe that the contractivity factor of the IFS is s = 0.51. So by the Collage 
Theorem, if A is the attractor, 


h({0, 1], A) < 0.015/0.49 < 0.04. 


10.3. Figure III.70 shows a target set L C R?, a leaf, represented by the polygonal- 
ized boundary of the leaf. Four affine transformations, contractive, have been applied 
to the boundary at lower left, producing the four smaller deformed leaf boundaries. 
The Hausdorff distance between the union of the four copies and the original is ap- 
proximately 1.0 units, where the width of the whole frame is taken to be 10 units. 
The contractivity of the associated IFS {R?; w, w2, w3, w4} is approximately 0.6. 
Hence the Hausdorff distance h(Euclidean) between the original target leaf L and 
the attractor A of the IFS will be less than 2.5 units. (This is not promising much!) 
The actual attractor, translated to the right, is shown at lower right. Not surprisingly, 
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Figure III.70. The Col- 
lage Theorem applied to 
a region bounded by a 
polygonalized leaf bound- 


ary. 


Figure 111.71. The re- 
gion bounded by a right- 
angle triangle is the union 
of the results of two simil- 
itudes applied to it. 


it does not look much like the original leaf! An improved collage is shown at the 
upper left. The distance h(L, Lf ail (L)) is now less than 0.02 units, while the con- 
tractivity of the IFS is still approximately 0.6. Hence h(L, A) should now be less 
than 0.05 units, and we expect that the attractor should look quite like L at the reso- 
lution of the figure. A, translated to the right, is shown at the upper right. 


10.4. To find an IFS whose attractor is a region bounded by a right-angle triangle, 
observe the collage in Figure III.71. 
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Collage Theorem to help 

you find an IFS consisting 

1 6) _ of two affine maps in R? 

i ; | | whose attractor is close to 
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10.5. A nice proof of Pythagoras’ Theorem is obtained from the collage in Fig- 
ure III.71. Clearly both transformations involved are similitudes. The contractivity 
factors of these similitudes involved are (b/c) and (a/c). Hence the area A obeys 
A= (b/c)*A + (a/c)*A. This implies c? = a? + b* since A > 0. 


10.6. Figures III.72-III.76 provide exercises in the application of the Collage The- 
orem. Condensation sets are not allowed in working these examples! 


10.7. It is straightforward to see how the Collage Theorem gives us sets of maps for 


IFS’s that generate A. A Menger Sponge looks like this: qa = iy. Find an IFS 


for which it is the attractor. 


10.8. The IFS that generates the Black Spleenwort fern, shown in Figure III.77, 
consists of four affine maps in the form 


x rcos@ —ssin@ x A : 
ws )= (ceo scos@ ) (2) +f) @= 123.4% 
see Table ITI.5. 


10.9. Find a collage of affine transformations in R?, corresponding to Figure III.78. 
10.10. A collage of a leaf is shown in Figure III.79 (a). This collage implies the IFS 


{C; w1, w2, w3, w4} where, in complex notation, 
w;(z) = siz + (1 — 5;)a; fori = 1, 2, 3, 4. 
Verify that in this formula a; is the fixed point of the transformation. The values 


found for s; and a; are listed in Table III.6. Check that these make sense in relation 
to the collage. The attractor for the IFS is shown in Figure III.79 (b). 


Sa ) 9g _ Figure Ill.72. Use the 
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Figure III.73. This image represents the attractor of 14 affine transformations in R?. Use 
the Collage Theorem to help you find them. 


Figure Ill.74. Use the 
Collage Theorem to help 
find a hyperbolic IFS of 
the form {R?; w;, w2, w3}, 
where w}), W2, and w3 are 
similitudes in R*, whose 
attractor is represented 
here. You choose the 
coordinate system. 


10 How to Make Fractal Models with the Help of the Collage Theorem 


10.11. The attractor in Figure III.80 is determined by two affine maps. Locate the 
fixed points of two such affine transformations on R?. 


10.12. Figure III.81 shows the attractor for an IFS {R?; w;,i = 1,2, 3,4} where 
each w; is a three-dimensional affine transformation. See also Color Plate 3. The 
attractor is contained in the region {(x;, x2, x3) € R°?: -10 < x; < 10,0 <x < 
10, —10 < x3 < 10}. 
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10.13. Find an IFS of similitudes in R? such that the attractor is represented by the 
shaded region in Figure III.82. The collage should be “just-touching,” by which we 
mean that the transforms of the region provide a tiling of the region: they should fit 
together like the pieces of a jigsaw puzzle. 


10.14. This exercise suggests how to change the coordinates of an IFS. Let {X;, d,} 
and {X>, d2} be metric spaces. Let {X 1; w;, w2,..., wy} be a hyperbolic IFS with 
attractor A,. Let 0 : X; — X, be an invertible continuous transformation. Consider 
the IFS {X»;80w,007!,00w2007!,...,9 0 wy 067}. Use 6 to define a metric 
on X> such that the new IFS is indeed a hyperbolic IFS. Prove that if Az € H(X>) is 


Table Ill.5. The IFS code for the Black Spleenwort, expressed in scale and angle formats. 


Translations Rotations Scalings 
Map h k 0 1) r s 

0.0 0.0 0 0 0.0 0.16 

0.0 1.6 2.5 <2). U.es U6 


0.0 -1.6 49 49 0.3 0.34 
00 044 120 -50 O03. °,Ual 


> WN — 
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Figure 111.75. Find an IFS of the form {R’; w;, w2, w3, wa}, where the w;’s are affine 
transformations on R’, whose attractor when rendered contains this image. Check your 
conclusion using Program 2. 


the attractor of the new IFS, then A7 = 0(A,). Thus we can readily construct an IFS 
whose attractor is a transform of the attractor of another IFS. 


10.15. Find some of the affine transformations used in the design of the fractal 
scene in Figure III.83. 


10.16. Use the Collage Theorem to find an IFS whose attractor approximates the 
set in Figure III.84. 
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10.17. Solve the problems proposed in the captions of (a) Figure III.85, (b) Fig- 
ure III.86, (c) Figure III.87. 


11 Blowing in the Wind: The Continuous Dependence 
of Fractals on Parameters 


The Collage Theorem provides a way of approaching the inverse problem: given a 
set L, find an IFS for which L is the attractor. The underlying mathematical principle 


Figure II1.76. How 
many affine transforma- 
tions in R* are needed 
to generate this attractor? 
You do not need to use a 
condensation set. 
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Figure 11.77. The 
Black Spleenwort fern. 
The top image illustrates 
one of the four affine 
transformations in the IFS 
whose attractor was used 
to render the fern. The 
transformation takes the 
triangle ABC to triangle 
abc. The Collage Theorem 
provides the other three 
transformations. The IFS 
coded for this image is 
given in Table III.3. Ob- 
serve that the stem is the 
image of the whole set un- 
der one of the transforma- 
tions. Determine to which 
map number in Table III.3 
the stem corresponds.The 
bottom image shows the 
Black Spleenwort fern and 
a close-up. 


is very easy: the proof of the Collage Theorem is just the proof of the following 
lemma. 


Lemma 11.1 Let (X, d) be a complete metric space. Let f : X —> X be a con- 
traction mapping with contractivity factor 0 < s < 1, and let the fixed point of f be 
x € X. Then 


d(x, xs) <(1—s)'- d(x, f(x)) for all x € X. 
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Figure Ill.78. Use the 
Collage Theorem to find 
the four affine transfor- 
mations corresponding to 
this image. Can you find 
a transformation which 
will put in the “missing 
comer”? 
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Proof The distance function d(a, b), for fixed a € X, is continuous in b € X. 
Hence 


d(x,x,)=d (x, lim fom(x)) = lim d(x, f°") 


(a) Collage 


< lim Daf"), FOP) 
m=1 


< lim d(, f@)d+s+---+s8"")<(1—s)"!d(x, f(x)). 


This completes the proof. 


The following results are important and closely related to the above material. 
They establish the continuous dependence of the attractor of a hyperbolic IFS on 
parameters in the maps that constitute the IFS. 


(b) Attractor 


Figure IlI.79. A col- 
lage of a leaf is obtained 


Table Ill.6. Scaling factors and fixed points for the collage in Figure III.79. ae 
nee centr een using four similitudes, 


K) a as illustrated in (a). The 
0.6 0.45 + 0.91 corresponding IFS is pre- 
0.6 0.45 + 0.31 sented in complex nota- 


tion in Table III.6. The 
attractor of the IFS is ren- 
dered in (b). 


0.4-0.31 0.60 + 0.31 
0.4+0.31 0.30+0.3i 


104 Chapter Ill Transformations on Metric Spaces; Contraction Mappings 


Lemma 11.2 Let (P,d,) and (X,d) be metric spaces, the latter being com- 
plete. Let w: P x X > X be a family of contraction mappings on X with contrac- 
tivity factor 0 <s < 1. That is, for each p € P, w(p, -) is a contraction mapping on 
X. For each fixed x € X let w be continuous on P.. Then the fixed point of w depends 
continuously on p. That is, x ¢ : P — X is continuous. 


Proof Let x;(p) denote the fixed point of w for fixed p € P. Let pe P and 
€ > 0 be given. Then for all g € P, 


d(x ¢(p), x¢(q)) =d(w(p, x¢(p)), w(q, x¢(q))) 
<d(w(p, x7(p)), w(q, xf(P))) 
+ d(w(q, x¢(p)), w(q, x¢(q))) 
<d(w(p, xs(p)), wg, x¢(p))) + sd(xz¢(p), X¢(Q)), 
which implies 
d(x (p), x¢(q)) < (1 — s)"'d(w(p, xf(p)), w(q, xf(p)))- 


The right-hand side here can be made arbitrarily small by restricting qg to be suffi- 
ciently close to p. (Notice that if there is a real constant C such that 


d(w(p, x), w(q, x)) < Cd(p, q) for all p, g € P, for all x € X, 
then d(x (p), x¢(q)) < (1—s)7'- C - d(p,q), which is a useful estimate.) This 
completes the proof. 
Examples & Exercises 


11.1. The fixed point of the contraction mapping w : R — R defined by w(x) = 
5x + p depends continuously on the real parameter p. Indeed, x = 2p. 


Figure 111.80. Locate yA 
the fixed points of a pair 


of affine transformations a: 
in R? whose attractor is 2 
rendered here. = 


11 Blowing in the Wind 


Figure Il1.81. Single 
three-dimensional fern. 
The attractor of an IFS of 
affine maps in R?. 


Figure I11.82. Find a 
“‘just-touching” collage of 
the area under this Devil’s 
Staircase. 
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Figure I11.83. Determine some of the affine transformations used in the design of this 
fractal scene. For example, where do the dark sides of the largest mountain come from? 


11.2. Show that the fixed function for the transformation w : C°[0, 1] > C°[0, 1] 
defined by w(f (x)) = pf (2xmod1) + x(1 — x) is continuous in p for p € (—1, 1). 
Here, C°[0, 1] = {f € C[0, 1]: f(0) = f(1) = 0} and the distance is d(f, g) = 
max{| f(x) — e(x)| . x € 10, EI}. 


In order for this to be of use to us, we need some method of moving the con- 
tinuous dependence on the parameter p to H(X). We cannot do this just because the 
image of a point in some set B depends continuously on p, since, although this gives 
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11 Blowing in the Wind 


Figure IlI.84. “Typi- 
cal” fractals are not pretty: 
use the Collage Theorem 
to find an IFS whose at- 
tractor approximates this 
set. 


Figure 111.85. Deter- 
mine the affine transfor- 
mations for an IFS corre- 
sponding to this fractal. 


_ Can you see, just by look- 


ing at the picture, if the 
linear part of any of the 
transformations has a neg- 
ative determinant? 
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Figure IlI.86. Use the 
Collage Theorem to ana- 
lyze this fractal. On how 
many different scales is 
the whole image appar- 
ently repeated here? How 
many times is the smallest 
clearly discernible copy 
repeated? 


us a 5 to constrain p with in order that w(p, x) moves by less than e¢, this relation 
is still dependent on the point (p, x). A set B € H(X), which is interesting, contains 
an infinite number of such points, giving us no 6 greater than 0 to constrain p with to 
limit the change in the whole set. We can get such a condition by further restricting 
w(p, x). Many constraints will do this; we pick one that is simple to understand. For 
our IFS, parametrized by p € P, that is {X:w,,..., wn,}, we want the conditions 
under which given € > 0, we can find a 6 > 0 such that 


dp(p,q) <5 => h(w,(B), wg(B)) <€. 


11 Blowing inthe Wind 109 


Figure 111.87. Consider 
the white areas in this fig- 
ure to represent a set S in 
R?. Locate the boundary 
of the largest pathwise- 
connected subset of S. It 
is recommended that you 
work with a photocopy 
of the image, a magnify- 
ing glass, and a fine red 
felt-tip pen. 
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Suppose that for every p € P, w;,(x) is a continuous function on X. Furthermore, 
we ask that there is a k > 0, independent of x and p such that for each fixed x € X 
and for each w;,, the condition 


d(w;,(x), wi,(x)) <k-dp(p, q) 


holds. This condition is called Lipshitz continuity. It is not the most general con- 
dition to prove what we need; we really only need some continuous function of 
d(p,q) which is independent of x on the right-hand side. We choose Lipshitz con- 
tinuity here because for the maps we are interested in, it is the easiest condition to 
check. If we can show that for any set B € H((X) we have 


h(w;,(B), wi,(B)) <k + dp(p, 9), 


then we can easily get the condition we want from the Collage Theorem. Proving 
this is simply a matter of writing down the definitions for the metric h. 


h(w,(B), wg(B)) = d(w,(B), wq(B)) V d(w4(B), wp(B)), 
where 
d(w,(B), wq(B)) = 2s Wq(B))) 
d(x, wg(B)) = min (d(x, y)). 
yew, (B) 


Now, x € w,(B) implies that there is an x € B such that x = w,(x). Then there is a 
point w,(x) € wg(B), which is the image of x under wz. For this point, our condition 
holds, and 


d(x, Wg(x)) <k-dy(p,q) => een Aes y)) < d(x, wg(X)) < k- d,(p, gq) 


Since this condition holds, for every x € wp(B) the maximum over these points is at 
most k - d,(p, q), and we have 


d(wp(B), wq(B)) <k-dp(p, q). 
The argument is nearly identical for d(w,(B), w,(B)), so we have 


h(wp(B), wg(B)) <k - dp(p, 9), 


and a small change in the parameter on a particular map produces a small change 
in the image of any set B € H(X). For a finite set of maps,w),,...,Wy,, and 
their corresponding constants k,,...,ky, it is then certainly the case that if k = 
maxj=1,...n (ki), we have 


h(w;,(B), wi,(B)) <k + dp(p, 4). 


Now the union of such image sets cannot vary from parameter to parameter by more 
than the maximum Hausdorff distance above, consequently, 
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h(W,(B), Wo(B)) <k - dp(p, q). 


We now apply the results of Lemma 11.2 to the complete metric space H(X), 
yielding 


h(Ap, Aq) < (1—8)"'h(Ap, Wa(Ap)) < (1 — 5)7"k - dp(p, q). 


Theorem 11.1 Let (X, d) be a complete metric space. Let {X; w ,..., wy} be 
a hyperbolic IFS with contractivity s. Forn =1,2,..., N, let w, depend on the 
parameter p € (P,d,) subject to the condition d(wn,(x), Wn, (x)) <k- dp(p, q) for 
all x € X with k independent of n, p, or x. Then the attractor A(p) € H(X) depends 
continuously on the parameter p € P with respect to the Hausdorff metric h(d). 


In other words, small changes in the parameters will lead to small changes in the 
attractor, provided that the system remains hyperbolic. This is very important be- 
cause it tells us that we can continuously control the attractor of an IFS by adjusting 
parameters in the transformations, as is done in image compression applications. It 
also means we can smoothly interpolate between attractors: this is useful for image 
animation, for example. 


Examples & Exercises 


11.3. Construct a one-parameter family of IFS, of the form {R*; wy, w2, w3}, 
where each w; is affine and the parameter p lies in the interval [0, 24]. The attrac- 
tor should tell the time, as illustrated in Figure III.88. A(p) denotes the attractor at 
time p. 

11.4. Imagine a slightly more complicated clockface, generated by using a one- 
parameter family of IFS of the form {R*; wo, w1, w2, w3}, p € [0, 24]. wo creates 
the clockface, w; and wz are as in Exercise 11.3, and w3 is a similitude that places 
a copy of the clockface at the end of the hour hand, as illustrated in Figure III.89. 
Then as p goes from 0 to 12 the hour hand sweeps through 360°, the hour hand on 
the smaller clockface sweeps through 720°, and the hour hand on the yet smaller 
clockface sweeps through 1080°, and so on. Thus as p advances, there exist lines 
on the attractor which are rotating at arbitrarily great speeds. Nonetheless we have 
continuous dependence of the image on p in the Hausdorff metric! At what times do 
all of the hour hands point in the same direction? 


11.5. Find a one-parameter family of IFS in R*, whose attractors include the three 
trees in Figure IIT.90. 


11.6. Run your version of Program 1 or Program 2, making small changes in the 
IFS code. Convince yourself that resulting rendered images “vary continuously” 
with respect to these changes. 

11.7. Solve the following problems with regard to the images (a)-(f) in Fig- 


ure III.91. Recall that a “just-touching” collage in R? is one where the transforms 
of the target set do not overlap. They fit together like the pieces of a jigsaw puzzle. 
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Figure IlI.88. A one- 
parameter family of IFS 
that tells the time! 


Figure 11.89. This 
fractal clockface depends 
continuously on time in 
the Hausdorff metric. 
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Figure 111.90. Blowing 
in the wind. Find a one- 
parameter family of IFS 
whose attractors include 
the trees shown here. 
The Random Iteration 
Algorithm was used to 
compute these images. 
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(a) Find a one-parameter family collage of affine transformations. 

(b) Find a “just-touching” collage of affine transformations. 

(c) Find a collage using similitudes only. What is the smallest number of affine 
transformations in R?, such that the boundary is the attractor? 

(d) Find a one-parameter family collage of affine transformations. 

(e) Find a “just-touching” collage, using similitudes only, parameterized by the 
real number p. 

(f) Find a collage for circles and disks. 
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Figure 111.91. Classical 
collages. Can you find 
an IFS corresponding to 
each of these classical 
geometrical objects? 


(2 P, P) 


Chapter IV 


Chaotic Dynamics 
on Fractals 


1 The Addresses of Points on Fractals 


We begin by considering informally the concept of the addresses of points on the 
attractor of a hyperbolic IFS. Figure IV.92 shows the attractor of the IFS: 


{C; wi(z) = (0.13 + 0.64i)z, w2(z) = (0.13 + 0.64i)z + 1}. 


This attractor, A, is the union of two disjoint sets, w;(A) and w2(A), lying to the 
left and right, respectively, of the dotted line ab. In turn, each of these two sets is 
made of two disjoint sets: 


w1(A) = wi(wi(A)) U w2(w)(A)), w2(A) = w2(w2(A)) U w2(w2(A)). 


This leads to the idea of addressing points in terms of the sequences of transfor- 
mations, applied to A, which lead to them. All points belonging to A, in the subset 
Ww 1(w,(A)), are situated on the piece of the attractor that lies below dc and to the left 
of ab, and their addresses all begin 11... .. Clearly, the more precisely we specify ge- 
ometrically where a point in A lies, the more bits to the address we can provide. For 
example, every point to the right of ab, below ef, to the left of gh, has an address 
that begins 212 .... In Theorem 2.1 we prove that, in examples such as this one, it is 
possible to assign a unique address to every point of A. In such cases we say that the 
IFS is “totally disconnected.” 

Here is a different type of example. Consider the IFS 


{C; w)(z) = 52 w2(z) = = =} . w3(z) = 2 = si} 

The attractor, A, of this IFS is a Sierpinski triangle with vertices at (0, 0), (1, 0), 
and (0, 1). Again we can address points on A according to the sequences of trans- 
formations that lead to them. This time there are at least three points in A that 
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Figure IV.92. Ad- 
dresses of points on an 


attractor. The lines ab, cd, 


ef, and gh are not part of 
the attractor. 


Every point in this part Addresses begin 
#22 of town has an address 22 


2 ® te 
se sat begins 12... git gets 
a 


ais 38 


Addresses begin Addresses begin 
His, a 21... 


have two addresses, because there is a point in each of the sets w;(A)M w2(A), 
w2(A) N w3(A), and w3(A) M w)(A) , as illustrated in Figure IV.93. 

On the other hand, some points on the Sierpinski triangle have only one address, 
such as the three vertices (0, 0), (1,0), and (0, 1). Although the attractor is con- 
nected, the proportion of points with multiple addresses is “small,” in a sense we 
do not yet make precise. In such cases as this we say that the IFS is “just-touching.” 
Notice that this terminology refers to the IFS itself rather than to its attractor. 

Let us look at a third, fundamentally different example. Consider the hyperbolic 
IFS 

1-73 


1 
0. f—<. =x). 
{[ aoe wee 


The attractor is A = [0, 1], but now 
w1(A) NM w2(A) = [0 s7e 1] (2 = 
Ww ay = 4? Lea i 
é yg 4°2 


SO wi(A) M w2(A) is a significant piece of the attractor. The attractor would look 
very different if the overlapping piece [Z. s] were missing. Now observe that every 
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Figure 1V.93. Some 
points on this Sierpin- 
ski triangle have two 
addresses, while others 
have only one address. 
Overlining on the last 
symbols, in an expres- 
sion such as 31111, means 
that the overlined symbols 
are repeated endlessly. 
For example, 31111 = 


31111111111111111 
1111..., and 31123 = 
3112520123125 ..: 
33333 
eee NS 
31111 SNS 
13333 23333 
Find the 
address 
x MS . NN | | 
4 a » b - . 
WNNS WN NN NANNING NS 
11111 NNNNNNNNANSN 22279 


point in (5, =] has at least two addresses. On the other hand, the points 0 and 1 have 
only one address each. Nonetheless, it appears that the proportion of points with 
multiple addresses is large. In such cases we say that the IFS is “overlapping.” 

The terminologies “totally disconnected,” “just-touching,” and “overlapping” re- 
fer to the IFS itself rather than to the attractor. The reason for this is that the same 
set may be the attractor of several different hyperbolic IFS’s. Consider, for example, 
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Figure IV.94. Differ- 
ent IFS’s with the same 
attractor provide differ- 
ent addressing schemes. 
Here the symbols {0, 1} 
are used in place of {1, 2} 
for obvious reasons. 


Binary addressing of the Alternative addresses of the 
interval [0,1] induced by the interval [0,1] induced by the 
IFS {[0,1],0.5x,0.5x+0.5} IFS {[0,1],0.5x,-0.5x+1} 


the two IFS’s 
1 1 1 
(10, 1I;'geit) = 55, wie) = 5% + 5} 
and 
l 1 
{[0, 1]; wie) = 5x, wo(x) = —5x + 1). 


The attractor of each one is the real interval [0, 1]. We can obtain two different 
addressing schemes for the points in [0, 1], as illustrated in Figure IV.94. 
These two IFS are just-touching. However, the IFS 


{{0, 1]; wi(x) = sh w(x) = * + ) 


is overlapping, while its attractor is also [0, 1]. 


Examples & Exercises 


1.1. Figure IV.95 shows the attractor of an IFS of the form {R?; w,,n = 1, 2, 3}, 
where each of the transformations w,, : R* > R? is affine. The addresses of several 
points are given. Find the addresses of a, b, and c. 


1.2. In Figure IV.95 locate the point whose address is 11 111. 


1.3. A quadtree is an addressing scheme used in computer science for addressing 
small squares in the unit square m = {(x;, x2) € R?: 0 < x; < 1,0 < x2 < 1} as fol- 
lows. The square is broken into four quarters. Points in the first quarter have ad- 
dresses that begin 0, points in the second quarter have addresses that begin 1, and 
so on, as illustrated in Figure IV.96. Find an IFS that gives rise to the addressing 
scheme suggested in Figure IV.96. Is this a totally disconnected, just-touching, or 
overlapping IFS? 


1.4. Addresses are assigned to the Sierpinski triangle, as in Figure I'V.93. Character- 
ize the addresses of the set of points that lie on the outermost boundary, the triangle 
with vertices Ti 22. and 33. 
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1995 3333 ~~ 


1.5. Characterize the addresses of points belonging to the boundary of the largest 
hole in Figure IV.97. 


1.6. Consider a hyperbolic IFS with condensation set C. Suppose the condensation 
set is itself the attractor of another hyperbolic IFS. Design an addressing scheme for 
the attractor of the IFS with condensation. Can all possible addresses occur? 


1.7. Figure IV.98 shows an “overlapping” IFS attractor, for two affine transforma- 
tions in R?. Choose one point in each of the marked regions on the attractor. Find the 
first four numbers in two different addresses for each of these points. The first few 
numbers in the addresses of some points on the attractor are included in the figure to 
remove possible ambiguities. 


1.8. x Identify the set of addresses of points on the attractor, A, of a hyperbolic 


IFS with code space. Argue that nearby codes correspond to points on A which are 


nearby. 


1.9. Address the real number 0.7513 in each of the two coding schemes given in 
Figure IV.94. 


In thinking about the addresses of points on fractals, already we have been led to 
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Figure IV.95. Can you 
find the addresses of a, b, 
and c? 


Figure 1V.96. Ad- 
dresses at depth two in 
a quadtree. 
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Figure 1V.97. Can you 
describe the addresses of 
the points on the bound- 
ary of the central white 
region? 


try to compare “how many” points have a certain property to how many have an- 
other property. For example, in the case of the addressing scheme on the Sierpinski 
triangle described above, we wanted to compare the number of points with multiple 
addresses to the number of points with single addresses. It turns out that both num- 
bers are infinite. Yet still we want to compare their numbers. One way in which this 
may be done is through the concept of countability. 


Definition 1.1 Let S be a set. S is countable if it is empty or if there is an onto 
transformation c: I — S, where I is either one of the sets 


14,15, 23; (ies)... 2d, 3... eS 
or the positive integers {1, 2,3, 4, ...}. S is uncountable if it is not countable. 


We think of an uncountable set as being larger than a countable set. 
We are going to make fundamental use of code space to formalize the concept of 
addresses. How many points does code space contain? 


Theorem 1.1 Code space on two or more symbols is uncountable. 


Proof We prove it here for the code space on the two symbols {1,2}. Denote 
an element of code space £ by w = @1@203..., where each w; € {1, 2}. Define 


‘ft i Who lives 


around here ? NRA \ 
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p:{1, 2} — {1, 2} by o(1) = 2 and p(2) = 1. Suppose code space is countable. Let 
the counting function be c: {1, 2, 3,...} — 2%. Consider the point o € & defined by 


O = 010203..., 


where o;, = p((c(n)),), and (c(n)), means the nth symbol of c(n). When does the 
counting function reach o? Never! For example, c(3) 4 o because their third sym- 
bols are different! This completes the proof. 


Examples & Exercises 


1.10. The set of integers N = {0, +1, +2, ..., } is countable. Define c : N > N by 
c(z) = (z — 1)/2 if z is odd, c(z) = —z/2 if z is even. 


Figure [V.98. Attrac- 
tor of a hyperbolic IFS in 
the overlapping case. In 
the overlapping regions 
multiple addresses are 
available. 
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1.11. Prove that a countable set of countable sets is countable. Show that an un- 
countable set, take away a countable set, is uncountable. 


1.12. The rational numbers are countable. A rational number can be written in the 
form p/q, where p and g are integers with q # 0. Figure IV.99 shows how to count 
the positive ones, some numbers being counted more than once. Make a rule that 
gets rid of the redundant countings. Also, show how to include the negative rationals 
in the scheme. 


1.13. Show that a Sierpinski triangle contains countably many triangles. 


1.14. Let S be a perfect subset of a metric space. Suppose that S contains more than 
one point. Prove that S is uncountable. 


Tt: Characterize the addresses of the missing pieces in Figure IV.100. 


2 Continuous Transformations from Code Space to Fractals 


Definition 2.1 Let {X; w1, w2,..., wy} be a hyperbolic IFS. The code space as- 
sociated with the IFS, (2,dc), is defined to be the code space on N symbols 
{1,2,..., N}, with the metric dc given by 


(oe) 
Wn — On| 
doo, Ce Tae. fae for alla,ao € =. 
n=] 


Our goal is to construct a continuous transformation @ from the code space as- 
sociated with an IFS onto the attractor of the IFS. This will allow us to formalize 
our notion of addresses. In order to make this construction we will need two lem- 
mas. The first lemma tells us that if we have a hyperbolic IFS acting on a complete 
metric space, but we are only interested in studying how the IFS acts in relation to a 
fixed compact subset of X, then we can treat the IFS as though it were defined on a 
compact metric space. 

Lemma 2.1 Let {X; wz:n = 1,2,..., N} bea hyperbolic IFS, where (X, d) is a 
complete metric space. Let K € H(X). Then there exists K € H(X) such that K c K 
and Wn. K > K forn= 1,2,..., N. In other words, {K; wyatt 4, 273308, 7 
is a hyperbolic IFS where the underlying space is compact. 


Proof Define W : H(X) — H(X) by 
W(B)=U™_,w,(B) _ forall B € H(X). 
To construct K consider the IFS with condensation cS agT Rh a004e 2, «ic ; NY, 
where the condensation map wy is associated with the condensation set K. By The- 


orem 7.1 in Chapter III the attractor of this IFS belongs to 7{(X). By exercise 9.1 in 
Chapter III it can be written 


K = Closure of (K UW°!(K) UW(K) UW°(K) UU... UW"(K) U...... ). 
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It is readily seen that K C K and that W(K) C K. This completes the proof. 


The next lemma makes the first step in linking code space to IFS attractors, by 
introducing a certain transformation ¢, which maps the Cartesian product space 
x x N x X into X. By taking appropriate limits, in Theorem 2.1 below, we will 
eliminate the dependence on N and X to provide the desired connection between 
and X. 


Lemma 2.2 Let {X; w,:n = 1,2,...,.N} be a hyperbolic IFS of contractivity 
s, where (X,d) is a complete metric space. Let (X,dc) denote the code space 
associated with the IFS. For eacho € X,n € N, and x € X, define 


GlO,.N, &) S Wy, Olle, 0... We (). 


Let K denote a compact nonempty subset of X. Then there is a real constant D such 
that 


d(go(o, mM, x1), plo, n, X2)) = i aaa 
for allo € X,allm,n €N, and all x;, x2. € K. 


Proof Let o, m,n, x,, and x2 be as stated in the lemma. Construct K from K 
as in Lemma 2.1. Without any loss of generality we can suppose that m <n. Then 
observe that 


oo, n, x2) = oo, mM, g(a, n—m, x2)), 


where 


@ = On—m+1On—m+2---On-+. EB. Figure IV.99. How to 
count the positive rational 


Let x3 = (@, n — m, x2). Then x3 belongs to K. Hence we can write 
numbers. What is c(24)? 


d(o(o, m, x1), (9, n, X2)) = d(H(o, m, x1), PCG, m, x3)) 


<sd(W,, 0... 0 Wg, (X1), Wo, 0... 0 Wo, (X3)) 
2 

< s“d(Wo, 0... 0 We, (X1), Wo, 0... O Wy, (X3)) 

<.§d(31,%2)- 8D, 


where D = max{d (x1, x3) : X1, x3 € K }. D is finite because K is compact. This com- 
pletes the proof. 


Theorem 2.1 Let (X, d) be a complete metric space. Let {X; w,:n=1,2,..., 
N} be a hyperbolic IFS. Let A denote the attractor of the IFS. Let (X, dc) denote 
the code space associated with the IFS. For eacho € &,n € N, and x € X, let 


(0, n, X) = Wg, O Wo, 0... O We, (X). 
Then 


$(o) = lim $(0,n, x) 
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Figure IV.100. Char- =. 


acterize the addresses of an 
the missing pieces. aon, & >. 
LALA AE DS 2s, 
aft yi 
ganas a Poe. a, 
Bin em ‘Not wn te, Pe a 
Poe's WHAT ARE THE ADDRESSES 


fy >, dn 


wf 
~ SR ae : OF THE 
te Rake eo ho hs an A ak cw sae 


oh 
on San De, 
AD, , MISSING on oe if O..8h, 

Sete Se e238. é ‘, Cn adetak BMD, ap dAnd’. nbn, 
: ath y 

PIECES? ty, Pa es Ad, de te goin gaat, 

§ ey} 4 Pe Ao gets cE 

A: En tastn 2. “ LF Eo 5 < ere we Saath Es ines ee EN 


exists, belongs to A, and is independent of x € X. If K is a compact subset of X, 
then the convergence is uniform over x € K. The function ¢ : = — A thus provided 
is continuous and onto. 


Proof Let x €¢ X. Let K € H(X) be such that x € K. Construct K as in Lemma 
2.1. Define W : H(X) — 7(X) in the usual way. By Theorem 7.1 in Chapter III, W 
is a contraction mapping on the metric space (H((X), A(d)); and we have 


A= lim {We"(K)}. 


In particular {W°"(K)} is a Cauchy sequence in (H, h). Notice that @(o,n, x) € 
W°"(K). It follows from Theorem 7.1 in Chapter II that if lim,_,.. d(o, n, x) exists, 
then it belongs to A. 

That the latter limit does exist follows from the fact that, for fixed o € D, 
{p(o, n, x)}°2, is a Cauchy sequence: by Lemma 2.2 


d(o(o,m, x), d(o,n,x)) < Ds™” for all x € K, 


and the right-hand side here tends to zero as m and n tend to infinity. The uniformity 
of the convergence follows from the fact that the constant D is independent of 
xekK. 

Next we prove that @ : £ — A is continuous. Let ¢ > 0 be given. Choose n so 
that s”D < €, and let o, w € & obey 


1 
dc(o, w) < pie 2 5p ius a 
5 (N+1)™ (N+1)"*! 


Then one can verify that 0 must agree with w through n terms; that is, 0, = @), 0) = 
W2,.-.; On = Wn. It follows that for each m > n we can write 


d(o(o, mM, x), o(o, m, x)) = d(p(o, n, x1), g(o, n, X2)), 
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for some pair x, x2 € K. By Lemma 2.2 the right-hand side here is smaller than s"D 
which is smaller than €. Taking the limit as m — oo we find 


d(¢(o), 6(w)) <€. 
Finally, we prove that ¢ is onto. Let a € A. Then, since 
A= lim W°"({x}), 
noo 


it follows from Theorem 7.1 in Chapter II that there is a sequence {w”) € X:n= 
1,2,3,...} such that 


lim ¢(o”,n,x) =a. 
n— CoO 


Since (Z, dc) is compact, it follows that {w” :n = 1, 2,3, ...} possesses a conver- 
gent subsequence with limit w € D. Without loss of generality assume lim,-,. 0”) 
= w. Then the number of successive initial agreements between the components of 
w”) and w increases without limit. That is, if 


a(n) = number of elements in {j € N: ow” ==.@y for 1.< k.< J}, 
where N = {1, 2, 3,...}, then a(n) > co asn —> o. It follows that 
d(o(w,n, x), @(w@™, n, x)) < s*™™D. 


By taking the limit on both sides as n — oo we find d(¢(w), a) = 0, which implies 
g@(w) =a. Hence ¢ : X — A is onto. This completes the proof. 


Definition 2.2 Let {X; w,,n =1,2,3,...,.N} be a hyperbolic IFS with associ- 
ated code space &. Let: = — A be the continuous function from code space onto 
the attractor of the IFS constructed in Theorem I. An address of a point a € A is any 
member of the set 


o¢'(A) ={w€ D: ¢(@) =a}. 


This set is called the set of addresses of a € A. The IFS is said to be totally discon- 
nected if each point of its attractor possesses a unique address. The IFS is said to be 
just-touching if it is not totally disconnected yet its attractor contains an open set O 
such that 


(1) w(O)Nwj(O) = Ovi, j €{1,2,...,N} with i #7; 
(2) Fas lim;=; wj(O) C O. 


An IFS whose attractor obeys (i) and (ii) is said to obey the open set condition. 
The IFS is said to be overlapping if it is neither just-touching nor disconnected. 


Theorem 2.2 Let {X; w,,n =1,2,..., N} be a hyperbolic IFS with attractor 
A. The IFS is totally disconnected if and only if 


wi(A)Nw,(A)=OVi, jef{l,2,...,N)  withi¥;. (1) 
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Proof If the IFS is totally disconnected, then each point on its attractor possesses 
a unique address. This implies Equation 3. If the IFS is not totally disconnected, then 
some point on its attractor possesses two different addresses. These must disagree 
at some first place: choose inverse images to get this place out front, to produce a 
contradiction to Equation 3. This completes the proof. 


Examples & Exercises 


2.1. Show that the attractor of the IFS {R; 5x, 5x + 5] is just-touching. Classify 
the attractor for the IFS {R; 5x, 1}. 


2.2. Prove that the attractor of the IFS {R; $x, 3x a i} is overlapping. 


2.3. Consider the IFS {[0, 1], wa(x) = 45 + 7px. = 1,2, 3,..., 10} and for the 
associated code space use the symbols {0, 1, 2,...,9}. Show that the attractor of 
the IFS is [0, 1] and that it is just-touching. Identify the addresses of points with 
multiple addresses. Show that the address of a point is just its decimal representation. 
Comment on the fact that some numbers have two decimal representations. 


2.4. Prove that the attractor to the IFS {[0, 1]; wi(x) = 5X, w7(x) = 5x + =) is 
totally disconnected. 


2.5. Prove that the IFS that generates the Black Spleenwort fern, given in Chapter 2, 
iS just-touching. 


2.6. Show that the IFS {[0, 1]; w;(x) = $, w(x) = 5} is overlapping. 


We need to understand the structure of code space. Theorem 2.1 told us that the 
code space on N symbols is the mother of all hyperbolic IFS consisting of N maps. 
We will use the following theorem to show that the mother is metrically equivalent 
to a classical Cantor set. 


Theorem 2.3 Let = denote the code space of the N symbols, {1,2,..., N}, and 
define two different metrics on & by 


lo @) 
x; Se yi| i 
dix, w= a dy(x, 
1(x, y) wy a(x, y) = al 
Then (X, d,) and (X, d>) are equivalent metric spaces. 


Proof We give the proof for the case N = 10. Let x, y € & be given. Clearly 
we have d>(x, y) < d;(x, y). We must show that there is a constant C so that 
Cd(x, y) < do(x, y), where C is independent of x and y. Here we pick C = 3 
and show that it works. 

We can suppose that for some k € {1,2,3,...}, X1 = yi, X2 = yo,..-, Xe-1 = 
Yk-1, Xk F Ye. Then 
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Figure IV.101. Noth- 
ing lands here. 
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x x =. [xi — Yi 
(Pea met Rew a tp ie ily 


= = 79 11! rae 11! 
1 = |x; ae {di 
27192. ii ge: y). 


This completes the proof. 


We now show that code space is metrically equivalent to a totally disconnected 
Cantor subset of [0, 1]. Define a hyperbolic IFS by {[0, 1]; w,(x) = Went +x: 
n= 1.2... Ni ts 

n n+1 
wall, IDS Wd 
as illustrated for N = 3 in Figure IV.101. 

The attractor for this IFS is totally disconnected, as illustrated in Figure IV.102 
for = 3. 

In the case N = 3, the attractor is contained in [3, 1]. The fixed points of the three 
transformations w,(x) = ix + i, w(x) = ix + s w2(x) = ix + ; are 7 z, and 1, 
respectively. Moreover, the address of any point on the attractor is exactly the same 
as the string of digits that represents it in base N + 1. What is happening here is 
this. At level zero we begin with all numbers in [0, 1] represented in base (N + 1). 
We remove all those points whose first digit is 0. For example, in the case N = 3 
this eliminates the interval [0, ak At the second level we remove from the remaining 
points all those that have digit 0 in the second place. And so on. We end up with 


] lon 17g 
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Figure IV.102. A spe- 
cial ternary Cantor set in 
the making. 


w, (4.1) w_({}.1)) w,((4.1)) 


those numbers whose expansion in base (N + 1) does not contain the digit 0. Now 
consider the continuous transformation @ : (2, dc) — (A, Euclidean). It follows 
from Theorem 2.3 that the two metric spaces are equivalent. @ is the transformation 
that provides the equivalence. Thus, we have a realization, a way of picturing code 
space. 


Examples & Exercises 

2.7. Find the figure analogous to Figure IV.102, corresponding to the case N = 9. 
2.8. What is the smallest number in [0, 1] whose decimal expansion contains no 
zeros? 


We continue to discuss the relationship between the attractor A of a hyperbolic 
IFS {X; w), w2,..., wy} and its associated code space &. Let ¢: & — X be the 
code space map constructed in Theorem 2.1. Let w = @1@2W3w,4... be an address of 
a point x € A. Then 


@ = JWG W304... 

is an address of w(x), foreach j € {1,2,..., N}. 

Definition 2.3 Let {X, w;, w2,..., wy} be a hyperbolic IFS with attractor A. 
A point a € A is called a periodic point of the IFS if there is a finite sequence of 
numbers {a (n) € {1,2,..., N}}7_, such that 

a = Wo(P) 0 Wo(P-1) 0... 0 We(1)(@). (2) 

Ifa € A is periodic, then the smallest integer P such that the latter statement is true 
is called the period of a. 


Thus, a point on an attractor is periodic if we can apply a sequence of w,,’s to it, 
in such a way as to get back to exactly the same point after finitely many steps. Let 
a €A be a periodic point that obeys (2). Let o be the point in the associated code 
space, defined by 

o =o0(P)o(P —1)...0(l)o(P)o(P —1)...0(l)o(P)o(P — 1)... 
=o(P)o(P —1)...oa(1). (3) 


2 Continuous Transformations from Code Space to Fractals 


Then, by considering lim,-... ¢(0, n, a), we see that P(o) =a. 


Definition 2.4 A point in code space whose symbols are periodic, as in (3), is 
called a periodic address. A point in code space whose symbols are periodic after a 
finite initial set is omitted is called eventually periodic. 


Examples & Exercises 

2.9. An example of a periodic address is 
1212121212121212121212121212121212121212121212121212121212..., 

where 12 is repeated endlessly. An example of an eventually periodic address is: 
1121111112111121111211112122121121212121212121212121212121..., 

where 21 is repeated endlessly. 


2.10. Prove the following theorem: “Let {X; w), w2,..., wy} be a hyperbolic IFS 
with attractor A. Then the following statements are equivalent: 
(1) x € A is a periodic point; 
(2) x € A possesses a periodic address; 
(3) x € A is a fixed point of an element of the semigroup of transformations 
generated by {w , w2,..., wy}.” 


2.11. Show that a point x € [0, 1] is a periodic point of the IFS 
+] 1 
’ 1 ; Sky eS =e 
{[0, 1] ee al 5} 
if and only if it can be written x = p/ (2% — 1) for some integer 0 < p < 2% — 1 and 


some integer N ¢€ {1, 2, 3,...}. 

2.12. Let {X; w}, w2,..., wn} denote a hyperbolic IFS with attractor A. Define 

W(S) = Le Ws (S) when S is a subset of X. Let P denote the set of periodic points 

of the IFS. Show that W(P) = P. 

2.13. Locate all the periodic points of period 3 for the IFS {R?; $z,5z+5,35z+ 

5}. Mark the positions of these points on A. 

2.14. Locate all periodic points of the IFS {R; w(x) = 0, w2(x) = 5x + 5}. 
Theorem 2.4 The attractor of an IFS is the closure of its periodic points. 
Proof Code space is the closure of the set of periodic codes. Lift this statement to 

A using the code space map ¢: & — A. (@ is a continuous mapping from a metric 


space & onto a metric space A. If S C & is such that its closure equals &, then the 
closure of f(S) equals A.) 


Examples & Exercises 


2.15. Prove that the attractor of a totally disconnected hyperbolic IFS of two or 
more maps is uncountable. 
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2.16. Under what conditions does the attractor of a hyperbolic IFS contain uncount- 
ably many points with multiple addresses? Do not try to give a complete answer; just 
some conditions: think about the problem. 


2.17. Under what conditions do there exist points in the attractor of a hyperbolic 
IFS with uncountably many addresses? As in 2.16, do not try to give a full answer. 


2.18. In the standard construction of the classical Cantor set C, described in exer- 
cise 1.5 in Chapter III, a succession of open subintervals of [0, 1] is removed. The 
endpoints of each of these intervals belong to C. Show that the set of such interval 
endpoints is countable. Show that C itself is uncountable. C is the attractor of the IFS 
10; 1i- 5X, zx te =}. Characterize the addresses of the set of interval endpoints in C. 


3 Introduction to Dynamical Systems 


We introduce the idea of a dynamical system and some of the associated terminol- 
ogy. 

Definition 3.1 A dynamical system is a transformation f :X — X ona metric 
space (X, d). It is denoted by {X; f}. The orbit of a point x € X is the sequence 
Fae ()j no" 

As we will discover, dynamical systems are sources of deterministic fractals. The 
reasons for this are deeply intertwined with IFS theory, as we will see. Later we 
will introduce a special type of dynamical system, called a shift dynamical system, 
which can be associated with an IFS. By studying the orbits of these systems we 
will learn more about fractals. One of our goals is to learn why the Random Iteration 
Algorithm, used in Program 2 in Chapter III, successfully calculates the images of 
attractors of IFS. More information about the deep structure of attractors of IFS will 
be discovered. 


Examples & Exercises 
3.1. Define a function on code space, f : 2 > &, by 
ft (XyHoXaXe bs.) SH XONAN4XS.s 
Then {2; f} 1s a dynamical system. 
3.2. {[0, 1]; f(x) =Ax(1 — x)} is a dynamical system for each A € [0, 4]. We say 
that we have a one-parameter family of dynamical systems. 


3.3. Let w(x) = Ax + ¢ be an affine transformation in R?. Then {R2; w} is a dy- 
namical system. 
3.4. Define T : C[0, 1] > C[0, 1] by 

1 


J. 1 1 
(Tf)(x) = 545%) + aI 5% + 5): 
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Figure IV.103. An 
example of a “stretch, 
squeeze, and bend” dy- 
namical system (Smale 
horseshoe function). 


Start 


Finish. Put the deformed 
space back on itself. 


Then {C[0, 1]; 7} is a dynamical system. 


3.5. Let w:C > C be a Mobius transformation. That is w(z) = (az + b)/(cz +d), 
where a, b, c, d € C, and (ad — bc) £0. Then {C; w(z)} is a dynamical system. 


3.6. {[0, 1]; 2xmod1} is a dynamical system. Here 2xmod1 = 2x — [2x], where 
[2x] denotes the greatest integer less than or equal to 2x. 


3.7. Define a transformation f : m= — mas illustrated in Figure IV.103. {m; f} is a 
dynamical system. | 


In dynamical systems theory one is interested in what happens when one follows a 
typical orbit: is there some kind of attractor that usually occurs? Dynamical systems 
become interesting when the transformations involved are not contraction mappings, 
so that a single transformation suffices to produce interesting behavior. The orbit of 
a single point may be a geometrically complex set. Some thought about horizontal 
slices through Figure IV.104 will quickly suggest to the inquisitive student that there 
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Figure IV.104. One 
million iterations of a 
small black square in a 
“stretch, squeeze, and 


bend” dynamical system. 


Can you find a relation- 
ship to IFS theory? 


is a Close relationship between this noncontractive dynamical system and a hyper- 
bolic IFS. 

Definition 3.2 Let {X; f} be a dynamical system. A periodic point of f is a point 
x € X such that f°"(x) =x for some n € {1, 2,3, ...}. If x is a periodic point of f; 
then an integer n such that f°"(x) =x,n € {1, 2,3, ...} is called a period of x. The 
least such integer is called the minimal period of the periodic point x. The orbit of 
a periodic point of f is called a cycle of f. The minimal period of a cycle is the 
number of distinct points it contains. A period of a cycle of f is a period of a point 
in the cycle. 


Definition 3.3 Let {X; f} be a dynamical system and let x  € X be a fixed point 
of f. The point x; is called an attractive fixed point of f if there is a number € > 0. so 
that f maps the ball B(x, €) into itself, and moreover f is a contraction mapping on 
B(xs, €). Here B(xs,€) ={y EX: d(x, y) <€}. The point x is called a repulsive 
fixed point of f if there are numbers € > 0 and C > 1 such that 


a(f (xs), f(y)) = Cd(xz, y) for all y € B(xs, €). 


A periodic point of f of period n is attractive if it is an attractive fixed point of 
f°". A cycle of period n is an attractive cycle of f if the cycle contains an attractive 
periodic point of f of period n. A periodic point of f of period n is repulsive if it 


3 Introduction to Dynamical Systems 


REPULSIVE FIXED POINT Figure 1V.105. The 


ON THE 
dynamics of a simple 
SPHERE Mobius transformation. 
Points spiral away from 
one fixed point and spiral 
in toward the other. What 
happens if the fixed points 
coincide? 
ATTRACTIVE FIXED POINT 
Se 
IN THE 
PLANE (¢ @) aN 
ee 


is a repulsive fixed point of f°”. A cycle of period n is a repulsive cycle of f if the 
cycle contains a repulsive periodic point of f of period n. 


Definition 3.4 Let {X, f} be a dynamical system. A point x € X is called an 
eventually periodic point of f if f°" (x) is periodic for some positve integer m. 


Remark: The definitions given here for attractive and repulsive points are con- 
sistent with the definitions we use for metric equivalence and will be used through- 
out the text. The definitions used in dynamical systems theory are usually more topo- 
logical in nature. These are given later in exercises 5.4 and 5.5. 


Examples & Exercises 


3.8. The point x = 0 is an attractive fixed point for the dynamical system {R; $ x}, 
and a repulsive fixed point for the dynamical system {R; 2x}. 


3.9. The point z = 0 is an attractive fixed point, and z = oo is a repulsive fixed 
point, for the dynamical system 


(C; (cos 10° + i sin 10°)(0.9)z}. 
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Figure IV.106. Points 

belonging to an orbit of a 
Mobius transformation on 
a sphere. 


Figure 1V.107. This 
shows an example of 
a web diagram. A web 
diagram is a means for 
displaying and analyzing Y 
the orbit of a point x» ¢ R 
for a dynamical system 
(R, f). The geometrical 
construction of a web 
diagram makes use of the 


graph of f(x). 


A typical orbit, starting from near the point of infinity on the sphere, is shown in 
Figures IV.105 and IV.106. 


3.10. The point x, = 111111 is a repulsive fixed point for the dynamical system 
{y; f} where f : & — & is defined by 


f (x1 X2x3X4X5 <* ) = XIX3X4X5.... 
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Show that x = 121212 is a repulsive fixed point of period 2, and that {1212, 2121} is 
a repulsive cycle of period 2. 


3.11. The dynamical system {[0, 1]; 5x(1 — x)} possesses the attractive fixed point 
x = 0. Can you find a repulsive fixed point for this system? 


There is a delightful construction for representing orbits of a dynamical system 
of the special form {R; f(x)}. It utilizes the graph of the function f : R + R. We 
describe here how it is used to represent the orbit {x, = f°"(xo)}°2, of a point 
xo ER. 2 

For simplicity we suppose that f : [0, 1] — [0, 1]. Draw the square {(x, y):0< 
x < 1,0 < y < 1} and sketch the graphs of y= f(x) and y =x for x € [0, 1]. 
Start at the point (xo, xo) and connect it by a straight-line segment to the point 
(xo, x1 = f(Xo)). Connect this point by a straight-line segment to the point (x1, x;). 
Connect this point by a straight-line segment to the point (x,, x2 = f(x;)); and con- 
tinue. The orbit itself shows up on the 45° line y = x, as the sequence of points 
(Xo, Xo), (X1, X1), (X2, X2), .... We call the result of this geometrical construction a 
web diagram. 

It is straightforward to write computergraphical routines that plot web diagrams 
on the graphics display device of a microcomputer. The following program is written 
in BASIC. It runs without modification on an IBM PC with Color Graphics Adaptor 
and Turbobasic. On any line the words preceded by a ’ are comments: they are not 
part of the program. 


Program 1. 
1=3.79 : xn=0.95 *parameter value 3.79, orbit starts 
at 0.95 
def fnf(xn)=l*xn*(1-xn) ’change this function f(x) for other 
dynamical systems 


screen 1: cls ?initialize computer graphics 

window (0,0)-(1,1) ’set plotting window to 0 <x <i, 
0<¥¥<4 

for k=1 to 400 *plot the graph of the f(x) 

pset (k/400, fnf(k/400)) 

next k 

do >the main computational loop 

n=n+1 increment the counter, $n$ 

y=fnf (xn) >compute the next point on the orbit 

line (xn,xn)-(xn,y), n ’draw a line from (xn,xn) to (xn,y) 
in color n 

line (xn,y)-(y,y), n ’draw a line segment from (xn,y) to 
(y,y) in color n 

xn=y *’set xn to be the most recently computed 


point on the orbit 


loop until instat : end ’stop running if a key is pressed. 
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Two examples of some web diagrams computed using this program are shown in 
Figure IV.108. The dynamical system used in this case is {[0, 1]; f(x) =3.79x(1 — 


x)}. | 
Examples & Exercises 


3.12. Rewrite Program 1 in a form suitable for your own computer environment. 
Use the resulting system to study the dynamical systems {[0, 1]; Ax(1 — x)} forA = 
0.55, 1.3, 2.225, 3.014, 3.794. Try to classify the various species of web diagrams 
that occur for this one-parameter family of dynamical systems. 


3.13. Divide [0, 1] into 16 subintervals (0, 7), [75. 7), ---. [43, 2). (2, Ui. Let 
f : [0,1] — [0, 1] be defined by f(x) =Ax(1 — x), where A ¢€ [0, 4] is a parame- 
ter. Compute {f°"(5) :n =0, 1, 2,..., 5000} and keep track of the frequency with 
which f°") falls in the kth interval for k = 1, 2, 4, 8, 16, and A = 0.55, 1.3, 2.225, 
3.014, 3.794. Make histograms of your results. 


3.14. Describe the behavior for the one-parameter family of dynamical system 
s{R U {co}; Ax}, where A is a real parameter, in the cases (1) A = 0; (ii) O < |A| < 1; 
(iii) A = —1; (iv) A= 1; (v) 1 <A < ©0. 


3.15. Analyze possible behaviors of {R?; Ax +t}, where Ax + t is an affine trans- 
formation. 


3.16. Study possible behaviors of orbits for the dynamical system {C:;Mébius 
transformation}. You should make appropriate changes of coordinates to simplify 
the discussion. 


3.17. Show that all points are eventually periodic for the slide-and-fold dynamical 
system {R; f}, where 

x+1 if x < 0, 

eed PS if x >0. 


This system is illustrated in Figure IV.109. 


3.18. Let {X; w;, w2,..., wy} be a hyperbolic IFS. Then {71(X); W} is a dynami- 
cal system, where 


W(B) = U™_,w,(B) for all B € H(X). 


Dynamical systems that act on sets in place of points are sometimes called set 
dynamical systems. Show that the attractor of the IFS is an attractive fixed point of 
the dynamical system {7{((X); W}. You should quote appropriate results from earlier 
theorems. 


3.19. We consider again our two-dimensional code space, having both past and 
future, called the space of shifts (see exercise 1.12 in Chapter II). In this space, the 
operation of the shift transformation is a homeomorphism of the space to itself (it is 
frequently called the shift automorphism). There is a very geometrical interpretation 
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Figure 1V.108. Two 
examples of web di- 
agrams computed us- 
ing Program 1. The dy- 
namical system in this 
case is {[0, 1]; f(x) = 
Ax(1 — x)}, for two dif- 
ferent values of A € (0, 4). 
The system corresponding 
to the lower value of A is 
orderly; the other is close 
to being chaotic. 
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Figure IV.109. An or- 
bit of the “slide-and-fold” 
dynamical system de- 
scribed in example 3.17. 
Can you prove that all 
orbits are eventually peri- 
odic? 


of the shift automorphism here. We arrive at this by looking at the action of the shift 
transformation with the metric d; with k = N, as in exercise 2.19, Chapter II. To 
simplify the discussion, assume N = 2. The space of shifts is a two dimensional 
code space with points 


Ce ahora... 5 VINES «) 


on which we put the “Euclidean” metric (see exercise 2.6 in Chapter II), 


d((x1, ¥1), (X2, ¥2)) = (S25 si) +(P5% x). 


i=] 


The shift transformation here is best described by writing 
(x, y) =... Y3V2Y1-X1X2X3---- 
We now shift by moving the dot one place to the right, to get 
T(x, ¥) =. «. Yovi di RiXaXa.. - = (09%3 «.. XE 90 -~-). 


With the metric just mentioned, we can relate it to the square [0, 1] x [0, 1]. Each 
point in this square has a binary expansion in terms of ones and zeros, so that a 
point (x, y) can be written (.x)x2...,.y1y2...), with precisely the same symbols 
and metric (Euclidean). The shift operation can now be seen as doing the following: 


stretch x: double x. This shifts the first digit up so that it is in the ones place. 
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Figure IV.110. A sign 
of things to come. 


squeeze y: halve y. This shifts a zero into the first digit and shifts all other digits 
down. 

raise half the interval: If the digit now in the ones place for x is a 1, replace the 
new 0 first digit of y with a 1. This adds a half to y. 

put it on top: If the digit in the ones place for x is a 1, discard it. This brings the 


x values to those between 0 and 1, so this half of the points is put above the 
other half. 


What we have done is stretch the square to twice its width (double x) and half its 
height (halve y), cut the rectangle into two pieces at x = 1, and put the right half on 
top of the bottom half (add 1/2 to y if the new x is greater than 1). This operation 
of stretching out the square, cutting it, and stacking the pieces is called a baker's 
transformation, because it resembles a baker rolling, cutting, and stacking dough (to 
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make pastry, for example). It is identical to the shift transformation on the space of 
shifts so long as the dough remains in distinct layers (unlike R*, 01 10). 


Definition 3.5 This transformation is famous because it is the heart of any 
invertible “mixing” function (one has to allow any number of cuts and for uneven 
rolling). A mixing function is a function f such that given any set A (from some 
class of sets; here let’s say sets with an interior), and any other set B from the same 
class, there is an N such that f"(A)N B £@ for anyn>N. 


The term mixing is appropriate: if A is red and B is blue, then eventually they 
are both somewhat purple (have both red and blue in them). A nice property of this 
mixing business is that there is at least one point in the space such that {f"(x) :n = 
1,2,...} is dense, that is, given an open set O, there is an n such that f”(x) € O. 
When f has this property, we say that it has a dense orbit. T is mixing on the space 
of shifts and on code space, and it has a dense orbit as a result. 


Examples & Exercises 


3.20. Prove that for any code space £ on N symbols, there is a point o € &, 
such that o has a dense orbit under the shift transformation, that is {7"(0) :n = 
1,2, 3, ..-} dense in 2. 


3.21. Show that T is mixing on code space for the class of open sets. 


4 Dynamics on Fractals: Or How to Compute Orbits by Looking at Pictures 


We continue with the main theme for this chapter, namely dynamical systems on 
fractals. We will need the following result. 

Lemma 4.1 Let {X; w,,n =1,2,..., N} bea hyperbolic IFS with attractor A. 
If the IFS is totally disconnected, then for eachn € {1,2,..., N}, the transformation 
w,: A — A Is one-to-one. 

Proof We use a code space argument. Suppose that there is an integer n € 
{1,2,..., MN} and distinct points a,, a2 € A so that w,(a)) = wp(a2) =a € A. If ay 
has address w and a> has address o, then a has the two addresses nw and no. This is 
impossible because A is totally disconnected. This completes the proof. 


Lemma 4.1 shows that the following definition is good. 

Definition 4.1 Let {X; w,,n =1,2,...,.N} be a totally disconnected hyper- 
bolic IFS with attractor A. The associated shift transformation on A is the trans- 
formation § : A — A defined by 

Sa)=w7'(a) ‘fora € w,(A), 


where w, is viewed as a transformation on A. The dynamical system {A; S} is called 
the shift dynamical system associated with the IFS. 
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Examples & Exercises 
4.1. Figure IV.111 shows the attractor of the IFS 


[mi047({*).047( 2) +(o)-047(2) +(0)}. 


Figure [V.111 also shows an eventually periodic orbit {a, = S°"(ap) }re 9 for the 
associated shift dynamical system. This orbit actually ends up at the fixed point 
(2222). The orbit reads ap = (13132222), a, = (31312222), ap = (132222), 
a3 = (32222), a4 = (2222), where ¢ : & — A is the associated code space map. 
a4 € A is Clearly a repulsive fixed point of the dynamical system. Notice how one 
can read off the orbit of the point ap from its address. Start from another point very 
close to do and see what happens. Notice how the dynamics depend not only on A 
itself, but also on the IFS. A different IFS with the same attractor will in general lead 


to different shift dynamics. 


4.2. Both Figures IV.112 and IV.113 show attractors of IFS’s. In each case the 
implied IFS is the obvious one. Give the addresses of the points {a, = S™” (ao) }P° 6 


of the eventually periodic orbit in Figure IV.112. Show that the cycle to which the 


Figure IV.111. An 
orbit of a shift dynamical 
system on a fractal. 
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orbit converges is a repulsive cycle of period 3. The orbit in Figure [V.113 is either 
very long or infinitely long: why is it hard for us to know which? 


4.3. Figure IV.114 shows an orbit of a point under the shift dynamical system 
associated with a certain IFS {R?; w;, w2, w3}, where w;, w2, and w3 are affine 
transformations. Deduce the orbits of the points marked b and c in the figure. 


4.4. Figure IV.115 shows the start of an orbit of a point under the shift dy- 
namical system associated with a certain hyperbolic IFS. The IFS is of the form 
{IR; w;, w2, w3}, where the transformations w, : R — R are affine and the attractor 
is [0, 1]. Sketch part of the orbit of the point labelled b in the figure. (Notice that this 
IFS is actually just-touching: nonetheless it is straightforward to define uniquely the 
associated shift dynamics on O M A where OC is the open set referred to in Definition 
2.2) 


We can sharpen up the definition of the overlapping IFS with the aid of the mixing 
properties discussed in section 3. Let {X; wi,..., wy} be a hyperbolic IFS, and 
define the set 


M =| J(wi(A) 0 wj(A)) 
iFj 
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Figure IV.113. A 
chaotic orbit getting 
started. The shift dy- 
namics are often wild. 
Why? 


of points in various intersections of the maps of the IFS. Then the following proper- 
ties hold: 


open interior: If there is a set O, open with respect to A, such that O Cc M, 
then the IFS is overlapping. This allows the IFS to be declared overlapping 
easily in some cases. The proof is not too difficult: Suppose this to be the 
case, namely that M contains an open set O. Suppose that O, were an open 
set that we thought might satisfy the open set condition for just-touching IFS. 
Then W"(O,) N O = @ for all n, since ©; can’t contain points in the overlap, 
and maps inside itself. Using the continuous map ¢@ : & — A, we know that 
we would then have ¢~'(©,) and ¢~!(©) both open sets in code space. But 
T"(¢—!(O)) must intersect ¢~!(O,) in code space for some n, due to mixing, 
and an address in the intersection corresponds to a point a on the attractor 
such that W"({a} thus intersects O. Hence ©, cannot exist, and the IFS is 
overlapping. 

dense address: Notice that in order to prevent the IFS from being just-touching 
in the proof just given, the orbit of ¢~'!(M) only needs to be dense in © 
to end up with points in the image of any open set in A. Consequently, 
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Figure IV.114. The ie 

orbit of the point a is A A 

shown. Can you plot A b 

the first few points of A A A 

the orbits of b and c? 

Warning! The IFS here is "Y: VW. 

not the usual one. See how hy’ A ie AD 

the knowledge of some A fe & rw 
dynamics can imply some A, A rN PNA, A 4 A 


more! 


Figure 1V.115. This 
figure shows a sketch of 
part of an orbit of an 
IFS {[0, 1]; wi, we, ws} 
on its attractor [0, 1]. 
The transformation w, : 
(0, 1] — [0, 1] is affine for 
i = 1,2, 3. Sketch part of 
the orbit of b. 


an IFS is overlapping if the orbit of ¢~'(M) in code space under the shift 
transformation is dense. 


empty interiors: If the IFS is made up of affine maps, and it “looks” just- 
touching, that is, M does not have an interior, then it is just-touching. The 
important property of affine maps used here is that they map boundary points 
to boundary points, and boundary points come from boundary points. 


4.5. The empty interiors property is not general, but is useful when it applies. To 
see why it is restricted, consider the following IFS, made of six translated copies of 
the following map: 

On the interval [—1, 1], let (x) = Arccos(x), that is, map x to the point directly 
above it on the unit circle. Then take 9(x) to the point 0(x) — asin @(x), where 
a € [0, 1/2). Then map the new point on the circle back to [—1, 1] by taking 0(x) 
to x’ = cos 6(x). Now fold the interval over at 0 with the map x’, and ensure that it 
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is contractive by dividing by 3. The interval has now been mapped to [0, 1/3]. Call 
this map v(x). Explicitly, we have 


1 
vit} 3 cos”(Arccos(x) — sin(Arccos(x)). 
We now form 6 maps w}, ..., we by translations and inversions of v(x): 


w1(x) = v(x) — 1 w(x) = —v(x) — : 
w3(x) = v(x) — : wa(x) = —v(x) + : 


] 
ws(x) = v(x) + 3 We(x) = —v(x) + 1. 
The reader should be able to verify that the attractor of the IFS 
{[-1, 1]; wi, we, w3, w4, Ws, We} 


is the interval [—1, 1], and that each of these maps touches any neighbor at a single 
point. There is a point xo(@) such that 


Arccos(x9) — @ sin Arccos(x9) = 2/2, 
whose image is the points 
{—1, —1/3, 1/3, 1}, 


and whose endpoints map to {—2/3, 0, 2/3}. By choosing a at different values we 
can move Xo around the interval [—1/3, 0]. If we pick o € & to be a dense orbit in 
x under the shift transformation, we can successively approximate this address for 
Xo such that the address of xo is 30, and we can do this for each such o € X&. We can 
also do this for a variety of periodic orbits, which are not dense. 

It turns out that for most values of xp € [—1/3, 0] (the probability of a value in this 
interval being one of these is 100%), this IFS is overlapping, although between every 
two values for which it is overlapping, there is a value for which it is just-touching. 
The attractors of this family of IFS are identical, as are the intersection points. 
This is thus both an example of an IFS that has a finite set of intersection points 
and is (sometimes) overlapping, and an example of one that does not go smoothly 
through the succession from totally disconnected to just-touching to overlapping. 
Small wonder these properties are defined for the IFS and not the attractor; they are 
really properties governing the behavior of addresses in code space. 


5 Equivalent Dynamical Systems 


Definition 5.1 Two metric spaces (Xi, d\) and (X2, dz) are said to be topologically 
equivalent if there is a homeomorphism f: X, — X. Two subsets S; C X; and S82 C 
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Figure 1V.116. Acom- 
mutative diagram that es- 
tablishes the equivalence 
between the two dynami- 
cal systems {X,; f,} and 
{X.; fo}. The function 

h : X, — X)> is a home- 
omorphism. 


X> are topologically equivalent, or homeomorphic, if the metric spaces (S,, d,) and 
(S2, d2) are topologically equivalent. S,; and Sz are metrically equivalent if (S,, d;) 
and (Sp, dz) are equivalent metric spaces. 


The Cantor set and code space, discussed following Theorem 2.3 in Chapter IV, 
are metrically equivalent. Theorem 8.5 in Chapter II tells us that if f : X; — Xp is 
a continuous one-to-one mapping from a compact metric (X;, d,) onto a compact 
metric space (X2, d2) then f is a homeomorphism. So by means of the code space 
mapping @: & — A (Theorem 2.1) one readily establishes that the attractor of a 
totally disconnected hyperbolic IFS is topologically equivalent to a classical Cantor 
set. 

Topological equivalence permits a great deal more “stretching” and “compres- 
sion” to take place than is permitted by metric equivalence. Later we will define a 
quantity called the fractal dimension. The fractal dimension of a subset of a metric 
space such as (IR, Euclidean) provides a measure of the geometrical complexity of 
the set; it measures the wildness of the set, and it may be used to predict your ex- 
citement and wonder when you look at a picture of the set. We will show that two 
metrically equivalent sets have the same fractal dimension. If they are merely topo- 
logically equivalent, their fractal dimensions may be different. 

With the naturally implied metrics, [0, 1] is homeomorphic to [0, 2]. m is homeo- 


is 


‘“ 
morphic to@. What is more, __ is even homeomorphic oe , and 


homeomorphic to AOA, 

In fractal geometry we are especially interested in the geometry of sets, and in the 
way they /ook, when they are represented by pictures. Thus we use the restrictive 
condition of metric equivalence to start to define mathematically what we mean 
when we say that two sets are alike. However, in dynamical systems theory we are 
interested in motion itself, in the dynamics, in the way points move, in the existence 
of periodic orbits, in the asymptotic behavior of orbits, and so on. These structures 
are not damaged by homeomorphisms, as we will see, and hence we say that two 
dynamical systems are alike if they are related via a homeomorphism. 


Definition 5.2 Two dynamical systems {X,; f;} and {X2; fo} are said to be 
equivalent, or topologically conjugate, if there is a homeomorphism 0 : X; — X» 
such that 

Fi(x)) = go! fo) hr fo) 8(x1) for all x; € Xj, 


fo(x2) =O 0 f, 067' (x2) for all x2 € X>. 


In other words, the two dynamical systems are related by the commutative diagram 
shown in Figure IV.116. 


The following theorem expresses formally what should already be clear intu- 
itively from our experience with shift dynamics on fractals. 
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Theorem 5.1 Let {X; wi, w2, ..., wy} be a totally disconnected hyperbolic IFS 
and let {A; S} be the associated shift dynamical system. Let X& be the associated code 
space of N symbols and let T : X — & be defined by 


T (010203...) = 0720304... for allo = 0)0203...€ D. 


Then the two dynamical systems {A; S} and {X; T} are equivalent. The homeomor- 
phism that provides this equivalence is ¢: & — A, as defined in Theorem 4.2.1. 
Moreover, {a1,a2,...,@p} is a repulsive cycle of period p for S if, and only if, 
{$ (a1), (a2), ..., $(ap)} is a repulsive cycle of period p for T. 


Examples & Exercises 


§.1. Let {Xi; f,} and {X2; f)} be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by 0 : X; — X>2. Show that 


{x1,%2,---,Xp} 
is a cycle of period p for {X;; /,} if and only if 
{6(x1), 8 (x2), eS O(xp)} 


is acycle of period p for {X2; f2}. Suppose that {x;, x2, ..., Xp} is an attractive cycle 
for f,. Show that this does not imply that {6(x;), ..., 9(x p)} is an attractive cycle for 
hr. 


5.2. Let {X1; f;} and {X2; f2} be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by 6 : X; —> Xp. Let { f7"(x)}P2, be 
an eventually periodic orbit of f,. Show that { f7"(A(x))}°2o is an eventually peri- 
odic orbit of fo. 


5.3. Let {X1; fi} and {X2; fo} be equivalent dynamical systems. Let a homeomor- 
phism that provides this equivalence be denoted by @ : X; — Xp». Let this homeo- 
morphism be such as to make the two spaces (X;, d;) and (X», d2) metrically equiva- 
lent. Construct an example where x ¢ € X; is a repulsive fixed point of the dynamical 
system {X,, f:} yet 9(x;) is not a repulsive fixed point of {X>, d}. 


5.4. Let {X; fi} and {X2; f2} be equivalent metric spaces. Let a homeomorphism 
that provides their equivalence be denoted by 6 : X; — Xp». Let x¢ € X; be a fixed 
point of f; . Suppose there is an open set © that contains x; and is such that x € O 
implies lim,+oo f°"(x) = x. Show that there is an open neighborhood of 0(x +) in 
X>» with a similar property. 


§.5. Our definition of attractive and repulsive fixed points and cycles, Definition 
3.4, has the feature that it depends heavily on the metric. It is motivated by the 
situation of analytic dynamics where small disks are almost mapped into disks. 
Show how one can use exercise 5.4 to make a definition of an attractive cycle in 
such a way that attractiveness of cycles is preserved under topological conjugacy. 
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Figure IV.117. Attrac- 

tive and repulsive fixed ? | 
points in a web diagram 

for a differentiable dy- y=x 
namical system. Analyze 


the ? points. my, 
ATTRACTIVE 
FIXED POINT 
|F*(X)| <1 


REPULSIVE } 
FIXED POINT 
IF*(X)| >1 

Y=f(X) 
a 


5.6. Let A C R. Then a function f : A > A is differentiable at a point xq € A if 
fon | 


x—>Xg x — XxX 
xeéA 0 


ee ee ee ee 


exists. If this limit exists it is denoted by f’(xo). Let {R; w), w2,..., wy} be a to- 
tally disconnected hyperbolic IFS acting on the metric space (IR, Euclidean). Sup- 
pose that, for each n = 1,2,..., N, wn(x) is differentiable, with |w’,(x)| > 0 for all 
x € R. Show that the associated shift dynamical system {A; S} is such that S is dif- 
ferentiable at each point xo € A and, moreover, |S’(xo)| > 1 for all x € A. 


§.7. Let {R; f} and {R; g} be equivalent dynamical systems. Let a homeomorphism 
that provides their equivalence be denoted by 0 : R > R. If 6(x) is differentiable for 
all x € R, then the dynamical systems are said to be diffeomorphic. Prove that aj is 
an attractive fixed point of f if and only if 0(a;) is an attractive fixed point of g. 


5.8. Let {R; f} be a dynamical system such that f is differentiable for all x € R. 
Consider the web diagrams associated with this system. Show that the fixed points 
of f are exactly the intersections of the line y = x with the graph y = f(x). Leta 
be a fixed point of f. Show that a is an attractive fixed point of f if and only if 
| f’(a)| < 1. Generalize this result to cycles. Note that if {a;, a2, ...,a)} is a cycle 
of period p, then #(f°?(x)|z=a, = f’(ai) f'(a2)... f’(ap). Assure yourself that the 
situation is correctly summarized in the web diagram shown in Figure IV.117. 


5.9. Consider the dynamical system {[0, 1]; f(x)} where 
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—2x when x € (0, 5], 
x—1 whenxe [5 1]. 


pay = |. 


Consider also the just-touching IFS {[0, 1], $x + 5, —3x + 5}. Show that it is possi- 
ble to define a “shift transformation,” S, on the attractor, A, of this IFS in such a way 
that {[0, 1]; S} and {[0, 1]; f(x)} are equivalent dynamical systems. To do this you 
should define S : A — A in the obvious manner for points with unique addresses; 
and you should make a suitable definition for the action of S on points with multiple 
addresses. 


5.10. Let {R*; w;, w2, w3} denote a one-parameter family of IFS, where 


“(3)=(% ag) (G) 
Ny) \ 0) (AB) Ay) 

x\ (C42) 0 \(x ate 
w(5)=( 0 oss) (5 )eCF )- 
x ky 0 x Ae 

w(5)=( 0 ey) (5) +( 0 ) for p10, 0 


Let the attractor of this IFS be denoted by A(p). Show that A(O) is a Cantor set 
and A(1) is a Sierpinski triangle. Consider the associated family of code space maps 
$(p): & — A(p). Show that ¢(p)(c) is continuous in p for fixed o € &; that is 
o(p)(ao) : [0, 1] > IR? is a continuous path. Draw some of these paths, including 
ones that meet at p = 1. Interpret these observations in terms of the Cantor set be- 
coming “joined to itself” at various points to make a Sierpinski triangle, as suggested 
in Figure IV.118. 

Since the IFS is totally disconnected when p = 0, @(p = 0): & — A(O) is in- 
vertible. Hence we can define a continuous transformation @ : A(O) — A(1) by 
6(x) = o(p = 1)(@|(p = 0)(x)). Show that if we define a set J(x) = {y € A(O): 
6(y) =x} for each x € A(1), then J(x) is the set of points in A(0) whose associated 
paths meet at x € A(1) when p = 1. Invent shift dynamics on paths. 
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Our goal in this section is to extend the definition of the shift dynamical system 
associated with a totally disconnected hyperbolic IFS to cover the just-touching and 
overlapping cases. This will lead us to the idea of a random shift dynamical system 
and to the discovery of a beautiful theorem. This theorem will be called the Shadow 
Theorem. 

Let {X; w1, w2,..., wy} denote a hyperbolic IFS, and let A denote its attractor. 
Assume that w, : A > A is invertible for each n = 1,2,..., N, but that the IFS is 


150 Chapter!lV Chaotic Dynamics on Fractals 


Figure IV.118. Con- 
tinuous transformation of 
a Cantor set into a Sierpin- 
ski triangle. The inverse 
transformation would in- 
volve some ripping. 
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L Figure IV.119. The 
two possible shift dynam- 
ical systems associated 
with the just-touching IFS 
{[0, 1]; 3x, 5x + 3} are 
represented by the two 
possible graphs of S(x). 
“Most” orbits are unaf- 
fected by the difference 

0 1 between the two systems. 


not totally disconnected. We want to define a dynamical system {A; S} analogous to 
the shift dynamical system defined earlier. Clearly, we should define 


S(x) = w7'(x) when x € w,(A), but x ¢ w»,(A) form £n, 


for each = 1, 2, ..4< 1. 

However, at least one of the intersections w,(A) M w,(A) is nonempty for some 
m #n. One idea is simply to make an assignment of which inverse map is to be 
applied in the overlapping region. For the case N = 2 we might define, for example, 


wy (x) when x € w;(A), 
w;'(x) whenx € A \ w)(A). 


In the just-touching case the assignment of where S takes points that lie in the 
overlapping regions does not play a very important role: only a relatively small 
proportion of points will have somewhat arbitrarily specified orbits. We look at some 
examples, just to get the flavor. 


six) =| 


Examples & Exercises 
6.1. Consider the shift dynamical — associated with the IFS 


{[0, 1]; a ar+5 5}: 


We have S(x) = 2x for x €[0, $) and S(x) = 2x —1 for x € (5, 1]. We can define 
the value of S (5) to be either 1 or 0. The two possible graphs for S(x) are shown in 
Figure IV.119. The only points x € [0, 1] = A whose orbits are affected by the defi- 
nition are those rational numbers whose binary expansions end ...0111 or... 1000, 
the dyadic rationals. 


6.2. Show that if we follow the ideas introduced above, there is only one dynami- 
= system {A; 5S} that can be associated with the just-touching IFS {[0, 1]; —; 
1 1x}. The key here is that wy '(x) = w3'(x) for all x € w;(A) N w2(A). 


6.3. Consider some possible “shift” dynamical systems {A; S} that can be associ- 
ated with the IFS 
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Figure IV.120. Two 
possible shift dynamical 
systems that can be associ- 
ated with the overlapping 
IFS {[0, 1]; 5x, ox + i}. 
In what ways are they 
alike? 


j ' 
{C; 2 a. ae 


The attractor, A, is overlapping at the three points a = w;(A)M w2(A), b= 
w2(A) 1 w3(A), and c= we) Mw (A). We ise define ©) — w) (a) or 
w;'(a), S(b) = w;'(b) or w;'(b), and S(c) = w;'(c) or w; '(c). Show that re- 
gardless of which definition is ae the orbits of a, b, and c are eventually periodic. 


6.4. Consider a just-touching IFS of the form {R?; w;, w2, w3} whose attractor is 
an equilateral Sierpinski triangle A. Assume that each of the maps is a simili- 
tude of scaling factor 0.5. Consider the possibility that each map involves a rota- 
tion through 0°, 120°, or 240°. The attractor, A, is overlapping at the three points 
a=w (A) w2(A), b = w2(A) MN w3(A), and c= —_ M w,(A). Show that it 
is possible to choose the maps so that w, l(a)= W> l(a), W> l(b) = W3 >! (b), and 
W3 c)= WwW, le), 

6.5. Is code space on two symbols topologically equivalent to code space on three 
symbols? Yes! Construct a homeomorphism that establishes this equivalence. 


6.6. Consider the hyperbolic IFS {%; t), t2,...,tv}, where & is code space on N 
symbols {1,2,..., N} and 

t,o =no for allo € &. 
Show that the associated shift dynamical system is exactly {; 7} defined in Theo- 
rem 4.5.1. Can two such shift dynamical systems be equivalent for different values 


of N? To answer this question consider how many fixed points the dynamical system 
{x; T} possesses for different values of N. 


6.7. Consider the overlapping hyperbolic IFS {[0, 1); Ax, 3x + i}. Compare the 
two associated shift dynamical systems whose graphs are shown in Figure IV.120. 
What features do they share in common? 


6.8. Demonstrate that code space on two symbols is not metrically equivalent to 
code space on three symbols. 


In considering exercises such as 6.7, where two different dynamical systems are 
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OVERLAPPING Figure IV.121. A par- 
NON-OVERLAPPING NON-OVERLAPPING tially random and par- 
: oe tially deterministic shift 
dynamical system as- 
oo sociated with the IFS 
fF {(0, 1]; $x, 3x + 4}. 


UNIQUE RANDOM UNIQUE 
DYNAMICS DYNAMICS DYNAMICS 


associated with an IFS in the overlapping case, we are tempted to entertain the idea 
that no particular definition of the shift dynamics in the overlapping regions is to 
be preferred. This suggests that we define the dynamics in overlapping regions in 
a somewhat random manner. Whenever a point on an orbit lands in an overlapping 
region we should allow the possibility that the next point on the orbit is obtained by 
applying any one of the available inverse transformations. This idea is illustrated in 
Figure IV.121, which should be compared with Figure I'V.120. 


Definition 6.1 Let {X; w,, w2} be a hyperbolic IFS. Let A denote the attractor 
of the IFS. Assume that both w; : A —> A and wz: A — A are invertible. A sequence 
of points {xXn}°2, in A is called an orbit of the random shift dynamical system 
associated with the IFS if 


0, 1a_) when x, € w1(A) and x, ¢ w1(A) M w2(A), 
Xn = 1.305 Gy) when x, € w2(A) and x, ¢ w1(A) MN w2(A), 
one of {w)!(%n), Wy (Xn)} when x, € w1(A) NM w2(A), 
for eachn € {0, 1,2, ...}. We will use the notation x, = S(x,) although there may 
be no well-defined transformation S : A — A that makes this true. Also we will write 
{A; S} to denote the collection of possible orbits defined here, and we will call {A; S} 
the random shift dynamical system associated with the IFS. 
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Notice that if w;(A) M w2(A) = @ then the IFS is totally disconnected and the 
orbits defined here are simply those of the shift dynamical system {A; 5S} defined 
earlier. 

We now show that there is a completely deterministic dynamical system acting 
on a higher-dimensional space, whose projection into the original space X yields the 
“random dynamics” we have just described. Our random dynamics are seen as the 
shadow of deterministic dynamics. To achieve this we turn the IFS into a totally 
disconnected system by introducing an additional variable. To keep the notation 
succinct we restrict the following discussion to IFS’s of two maps. 


Definition 6.2 The lifted IFS associated with a hyperbolic IFS {X; w,, w2} is 
the hyperbolic IFS {X x X; Ww, W2}, where & is the code space on two symbols 
{1,2} , and 


W(x, 0) = (w(x), lo) for all (x,o) EX x &; 
W2(x, 0) = (w2(x), 20) for all (x,0) € X x &. 
What is the nature of the attractor A C X x & of the lifted IFS? It should be clear 
that 
A={x€A:(x,0) € A}and D ={o €D: (x,0) € A}. 


In other words, the projection of the attractor of the lifted IFS into the original space 
X is simply the attractor A of the original IFS. The projection of A into & is D. 
Recall that & is equivalent to a classical Cantor set. This tells us that the attractor of 
the lifted IFS is totally disconnected. 


Lemma 6.1 Let {X; w;, w2} be a hyperbolic IFS with attractor A. Let the two 
transformations w,;: A—> A and w,.: A— A be invertible. Then the associated 
lifted IFS is hyperbolic and totally disconnected. 


Definition 6.3 Let {X; w,, w2} be a hyperbolic IFS. Let the two transformations 
w,:A— Aand w,:A— A be invertible. Let A denote the attractor of the associ- 
ated lifted IFS. Then the shift dynamical system {A; S} associated with the lifted IFS 
is called the lifted shift dynamical system associated with the IFS. 


Notice that 
S(x,0) =(wz'(x),T(o)) for all(X,0) € A, 
where 
T (01020304...) = 02030405... for all o = 01020304....€ . 
Theorem 6.1 /(The Shadow Theorem).] Let {X; w, w2} be a hyperbolic IFS of 
invertible transformations w, and w2 and attractor A. Let {xn}-_, be any orbit of the 


associated random shift dynamical system {A; S} . Then there is an orbit {x,}°_) of 
the lifted dynamical system {A; S} such that the first component of Xp is x, for all n. 
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Cantor set of 


SEEN infinitesimal 
FROM leaflets 
kine grouped 

in four 
THE Eo : 
SET 
1S Each “leaflet” 
TOTALLY 2.3, is a microcosm 
DISCONNECTED <*> 4, of the whole 


4 leaflet stack 


THE SHADOW OF THE CANTOR SET 
IS A LEAF, THE ATTRACTOR OF AN IFS. 


We leave the proofs of Lemma 6.1 and Theorem 6.1 as exercises. It is fun, how- 


ever, and instructive to look in a couple of different geometrical ways at what is 
going on here. 


Examples & Exercises 
6.9. Consider the IFS {C; w;(z), w2(z), w3(z), wa(z)} where, in complex notation, 
w1(z) = (0.5)(cos 45° — /—1 sin 45°)z + (0.4 — 0.2V—1), 
w2(z) = (0.5)(cos 45° + /—1 sin 45°)z — (0.4 + 0.2V—1), 
w3(z) = (0.5)z + V—1(0.3), 
wa(z) = (0.5)z — /—1(0.3). 


A sketch of its attractor is included in Figure IV.122. It looks like a maple leaf. The 
leaf is made of four overlapping leaflets, which we think of as separate entities, at 
different heights “above” the attractor. In turn, we think of each leaflet as consisting 
of four smaller leaflets, again at different heights. One quickly gets the idea: one 
ends up with a set of heights distributed on a Cantor set in such a way that the 
shadow of the whole collection of infinitesimal leaflets is the leaf attractor in the 
C plane. The Cantor set is essentially ©. The lifted attractor is totally disconnected; 
it supports deterministic shift dynamics, as illustrated in Figure IV.123. 


Figure IV.122. The lift 
of the overlapping leaf 
attractor is totally dis- 
connected. Deterministic 
shift dynamics become 
possible. See also Fig- 
ure [V.123. 
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Figure IV.123. A pic- 
ture of the Shadow The- 
orem. Deterministic dy- 
namics on a totally discon- 
nected dust has a shadow 
that is dancing random 
shift dynamics on a leaf 
attractor. 


ee 


DETERMINISTIC 
SHIFT DYNAMICS 
ON THE 

LIFTED 

LEAF 


RANDOM 
SHIFT DYNAMICS 
ON THE LEAF 
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6.10. Consider the overlapping hyperbolic IFS {R; 3x, 3 2x + 3}. We can lift this to 
the hyperbolic IFS {R?; w:(x), w2(x)}, where 


w(2)=(5 9) (2)+() 


The attractor A of this lifted system is shown in Figure IV.124, which also 
shows an orbit of the associated shift dynamical system. The shadow of this orbit 
is an apparently random orbit of the original system. The Shadow Theorem asserts 
that ial orbit Unn=o of a random shift dynamical system associated with the IFS 
{R; 5x, 3x oe +} is the projection, or shadow, of some orbit for the shift dynamical 
system associated with the lifted IFS. 


6.11. As a compelling illustration of the Shadow Theorem, consider the IFS 


{R; sm, arta opis). 
Let us look at the orbits {x,}°2, of the shift tas system specified in the 
left-hand graph of Figure IV.120. In this case we always choose S(x) = w, 1x) 
in the overlapping region. What orbits {X,}°2, of the lifted system, described in 


exercise 6.7, are these orbits the shadows of? Look again at Figure [V.124! Define 
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OVERLAPPING 
A REGION 


A looks like 
a Classical 

Cantor Set 
when seen 
from the 
side. 


the top of A as 
Atop ={(x, y)€A:(Z, y)€ASz<x, and ye (0, 1}}. 


Notice that S$: Avs “> ee It is easy to see that there is a one-to-one correspon- 
dence between orbits of the lifted system {A top; S } and orbits of the original system 
specified through the left-hand graph of Figure IV.120. Indeed, 

{(Xn; Yn) }7 is an orbit of the lifted system and (xo, yo) € Ass 


t 
{Xn}P-.9 is an orbit of the left-hand graph of Figure IV.120 


6.12. Draw some pictures to illustrate the Shadow Theorem in the case of the just- 
touching IFS {[0, 1]; 5x, 5x + 5}. 


6.13. Illustrate the Shadow Theorem using the overlapping IFS {[0, 1]; —3x “ 
3, 3x + t}. Find an orbit of period 2 whose lift has minimal period 4. Do there exist 
periodic orbits whose lifts are not periodic? 


6.14. Prove Lemma 6.1. 
6.15. Prove Theorem 6.1. 


Figure !V.124. The 
Shadow Theorem asserts 
that the random shift 
dynamical system orbit on 
the overlapping attractor 
A is the shadow of a 
deterministic orbit on A. 
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6.16. The IFS {Z; w;(c), ..., ww(o)} given by 
w,(o) =no 
for each n = 1,2,..., N, has an interesting lift. Show that the lift of this IFS, with 


a suitably defined inverse, is the shift automorphism on the space of shifts and 
therefore equivalent to the baker’s transformation. 


6.17. In section 5 it was shown that the associated shift dynamical system of any 
totally disconnected IFS is equivalent to the shift transformation on code space. 
Then we may replace the second map in the lift for the Shadow Theorem with such 
a totally disconnected IFS. That is, we could take a map like the leaf shown in 
Figures IV.122 and IV.123, and define the map 


(IR? x A; w(x, y),... wax, y)}, 


where w; = (w;_ Ze 2 y), vj(x, y)), Where v; are the maps of the totally disconnected 
IFS 


0 
1/3 0 x 1 
- 
ee ee y 0 
1f3- 0 x 4 1 
Dass y Se 
: 4. fe 8 x 0 
aU va) (G)*(a): 
Since this IFS produces an attractor that is totally disconnected, and therefore a copy 


of code space, the resulting lift is totally disconnected. What would a rendition of the 
lifted system look like if the maple leaf were lifted using a totally disconnected tree? 


eed a ee ee 
7 The Meaningfuiness of Inaccurately Computed Orbits Is Established by 


Means of a Shadowing Theorem 


Let {X; w, W2,..., wy} be a hyperbolic IFS of contractivity 0 <s < 1. Let A de- 
note the attractor of the IFS, and assume that w, : A —> A is invertible for each 
n=1,2,..., N. If the IFS is totally disconnected, let {A; S} denote the associated 
shift dynamical system; otherwise let {A; 5} denote the associated random shift dy- 
namical system. Consider the following model for the inaccurate calculation of an 
orbit of a point x9 € A. This model will surely describe the reader’s experiences 
in computing shift dynamics directly on pictures of fractals. Moreover, it is a rea- 
sonable model for the occurrence of numerical errors when machine computation is 
used to compute an orbit. 
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Plate | 


Ice in trees exhibits natural affine redundancy. 


Plate 2 ! ? 
The same IFS with two different coloring schemes: color is determined by the 


Hausdorff distance from the fern attractor. 


. Plate 3 
Three-dimensional ferns. 


Plate 4 

A truly four-dimensional fern. As we rotate 
this weird object in the three dimensions in 
which we live, what will we see emerging 
from the fourth dimension? Computed 
using 4-D Fernmaker™. 


: Plate 5 

This sequence of images illustrates the 
continuous dependence of the attractor on 
the parameters in the IFS code. 
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Plate 6 
The result of running a version of Program 
] in Chapter VII on a Masscomp 5600 workstation 


with Aurora graphics. 


Plate 7 

The result of running a version of Program 1 in 
Chapter VII, with (a, b) = (0, 0), (c, d) 

=(5 X 10", 5X 10"), 

and numits = 65. This viewing window is minute, 
the colors arranged in a wonderful pattern: we are 
stunned by the geometrical complexity of this 
image at such high magnification. 


Plate 8 
A seed beginning to germinate? 


Ee 


Plate 9 , 
~The Escape Time Algorithm applied 

to an IFS of affine transformations. — 

Such images can be magnified 
enormously, without loss of mystery. 


Plate 10 


Primitive art? 


Plate 11 

The Escape Time Algorithm is applied to an overlapping 
IFS of two affine transformations. Can such sets be used 
to model sections of plants, as can be seen under a 
microscope? 


Plate 12 

_ The Escape Time Algorithm is applied to an 
IFS of two maps. Here the magnification is ex- 
treme. Do not race by. Look closely and think. 


Plate 13 
The Escape Time Algorithm is applied to pro- 
duce an image of the Julia set for z? - 1. 
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Plate 14 


The Escape Time Algorithm is used to analyze the 
Newton transformation associated with the polynomial 
z'— 1. Here the Riemann sphere is mapped onto a 
rectangle using Mercator's projection. The North Pole 
at the top corresponds to the Point at Infinity, while 
the South Pole, at the bottom, corresponds to the 
Origin. The roots are a, = 1, a,=-1, a,=i, 

a,=~i. Points whose orbits converge to a, 

are plotted in dark green; points whose orbits 
converge to a, are plotted in very light green; 

points whose orbits converge to a, are plotted 

in light green; and points whose orbits con- 

verge to a, are plotted in white. 


Plate 15 

Zoom in on a piece of Plate 14 and change the 
color scheme. Every point on a boundary between 
two colors is actually on the boundary of all four 
colors! 


Plate 16 

Part of the Mandelbrot set 
for z>—X, painted in 
colors according to 
“escape times." The 
green land represents val- 
ues of the complex 
parameter A for which 

the associated Julia set is 
connected. The blue sea 
corresponds to totally dis- 
connected Julia sets. The. 
exciting place is the 
coastline, where the white 
foam of the breaking 
waves symbolizes the 
transition between con- 
nection and disconnec- 
tion. 


Plate 17 | 
Part of the parameter space P = {A €C:lAl <1} 
for the family of dynamical systems 

_ f(z—- IA if Rez 20; 
re= (z+ 1X if Rez <0. 
ordered according to the “escape time" of a point 
Oe R?. 


Plate 19 7 

A map of a small piece of the parameter space for 
the family of dynamical systems in Example 4.2 
in Chapter VII. The map was computed using 
Algorithm 4.1 in Chapter VIII. The grainy 
multicolored areas resemble the repelling sets 

of the dynamical system for the corresponding 
values of X. 


Plate 18 


An extreme close-up of 
a piece of the parameter 
space in Color Plate 17. - 


Plate 20 
An extreme close-up on 
a grainy region in Color 
Plate 19. 


LS — 
Plate 21 
Chimneys in a landscape 
giving off smoke. 


Plate 22 


Zoom on the smoke com- 
ing out of a chimney. 


Plate 23 

Compare the simple, green IFS 
attractor and the corresponding 
Borel measure rendered using 
autumnal colors. 


Plate 24 


Compare the collage for a maple leaf and the corresponding Borel measure. : 


RS 


Plate 25 

A sequence of frames taken 
from an IFS encoded movie 
entitled A Cloud Study 
[Barnsley, 1987a]. This 
illustates the continous 
dependence of the invar- 
iant measure on parameters 
in the IFS code. Theorem 
11.1 in Chapter IIT makes 
movies! 


Plate 27 
Andes Girl. 


Plate 26 
Monterey Coast. 


Plate 28 
Arctic Wolf. 


Piate 30 
Sunflower Field. 


Plate 29 
Leaf and Sunflower, condensation sets used in 
Color Plate 30. 
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Zoom on Black Forest. 


Plate 32 
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Black Forest in Winter. 


Plate 31 
Black Forest 
Plate 33 


Plate 35 
A rendered Borel measure mimics 
leaping flames. 


Plate 34 
Zoom on Black Forest in Winter. 
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Plate 36 

A fern and three progressive 
zooms. It was constructed 

using a recurrent IFS. Notice 

how at one scale the fronds 

are alternating and at 

another scale they are 

opposed. 


Plate 37 

An image of a root in soil. Notice how the gravel appears 
at many scales around the root. This plate illustrates a 
measure UL € P, which is a fixed point of the Markov oper- 
ator M associated with a recurrent IFS of affine maps, 
where the maps have been carefully chosen to create this 
image. Computed using VRIFS™. 


Plate 38 

This picture of birds and trees is made up of a number of segments, 
each of which is made from a vector recurrent IFS with probabilities 
using VRIFS™. The PRESENTS™ system is then used to map the 
grayscale values for each segment into color values using a look-up 
table which is then adjusted interactively to make the image look as 
realistic as possible. | | 
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Plate 39 


A fractal tree. Computed using VRIFS™ and colored 
by PRESENTS™. 


Plate 40 

A collection of TV cameras pointing 
at TV screens illustrates the idea of 
a Vector Recurrent Iterated Function 
System. There is a Markov process 
on each “screen.” Some points are 
spawned and other trajectories die. 


Plate 41 

The transformation W is a 
contraction mapping in the 
space of vectors of pictures. 
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Pictures 
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Plate 42 

An original photograph of a cafe in 
Valencia, Spain 

was scanned to create a 24-bit per 
pixel file that was 

1,345,960 bytes in size. 


Plate 44 

The decompressed image after 
compression down 

to 30,173 bytes; a compression 
ratio of 44:1. 


Plate 43 

The decompressed image after 
compression down to 147,557 bytes; a 
compression ratio of 9:1. Compres- 
sion and decompression for Plates 43, 
44, and 45 were done using the fractal 
transform as implemented in Images 
Incorporated™ 2.0. 


Plate 45 


The decompressed image after 
compression down 

to 23,399 bytes; a compression ratio 
of 58:1. 
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7 The Meaningfulness of Inaccurately Computed Orbits 


Let an exact orbit of the point xp € A be denoted by {Xn}, Where x, = S°"(xo) 
for each n. Let an approximate orbit of the point x9 € A be denoted by {%,,}°°,, where 
Xo = Xo. Then we suppose that at each step there is made an error of at most 6 for 


some 0 < 6 < oo; that is, 
d(Xn+1, Sin) <0 fom == 01 2... 


We proceed to analyze this model. It is clear that the inaccurate orbit {x,}°° will 
usually start out by diverging from the exact orbit {x,}°°, at an exponential rate. 
It may well occur “accidentally” that d(x,, x,) is small for various large values 
of n, due to the compactness of A. But typically, if d(x,, X,) is small enough, 
then d(Xn+;, Xn4+;) will again grow exponentially with increasing j. To be precise, 
suppose d(X;, 5(Xo)) = 6 and that we make no further errors. Suppose also that for 
some integer M, and some integers 0;, 02,..., 0y € {1,2,..., N}, we have 


Xn ONGX, € Wo (A), forn =O, 1,2, ...,.M. 
Moreover, suppose that 
Xn 1 = We (iq) and Spy5 ee, Ry), for ee a 
Then we have 
WSntis inet) & SO, tern =, 12... 
For some integer J > M it is likely to be the case that 
tsi: = Ws, (x,) and 3,44 = Ws, nds for some o, #4). 


Then, without further assumptions, we cannot say anything more about the correla- 
tion between the exact orbit and the approximate orbit. Of course, we always have 
the error bound 


d(Xn, Xn) < diam(A) = max{d(x, y):x € A, y EA}, foralln = 1, 2,3,.... 


Do the above comments make the situation hopeless? Are all of the calculations 
of shift dynamics we have done in this chapter without point because they are riddled 
with errors? No! The following wonderful theorem tells us that however many errors 
we make, there is an exact orbit that lies at every step within a small distance of our 
errorful one. This orbit shadows the errorful orbit. This type of theorem is extremely 
important in dynamics, and in any class of dynamical systems that has one (such as 
IFS) behavior that can be accurately analyzed using graphics on computers. Here we 
are use the word “shadows” in the sense of a secret agent who shadows a spy. The 
agent is always just out of sight, not too far away, usually not too close, but forever 
he follows the spy. 


Theorem 7.1 The Shadowing Theorem. Let {X; w;, w2,..., wy} be a hy- 
perbolic IFS of contractivity s, where 0 < s < 1. Let A denote the attractor of the 
IFS and suppose that each of the transformations w,:A—> A is invertible. Let 
{A; S} denote the associated shift dynamical system in the case that the IFS is totally 
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disconnected; otherwise let {A; S} denote the associated random shift dynamical 
system. Let {Xn}°24 C A be an approximate orbit of S, such that 

d(Xn+1, S(Xn)) <9 for all n =0, 1, 2,3,..., 
for some fixed constant 6 with O < @ < diam(A). Then there is an exact orbit {x, = 


S°" (x9)}°2.9 for some xo € A, such that 


for alin O01 2... 


. sO 
d(Xn+1, ee < (Ts) 


Proof As usual we exploit code space! For n=1,2,3,..., let o, € {1, 


2,...,N} be chosen so that wz’, wz,', wz’, ..., is the actual sequence of inverse 


maps used to compute S{xo), S(X1), S(x2),..-. Leto: Um A denote the code space 
map associated with the IFS. Then define 


Xo = $(010203...). 
Then we compare the exact orbit of the point xo, 
{Xn = S°"(X0) = $(Ont19n42-- =o 


with the errorful orbit {%,,}°2o. 
Let M be a large positive integer. Then, since xy and S(xy-_;) both belong to A, 
we have 


d(S(xy-1), S(xm-1) < diam(A) < oo. 


Since S(xy_1) and S(Xy_,) are both computed with the same inverse map we! it 
follows that 


d(xy-1, Xm-—1) <sdiam(A). 
Hence | 
d(S(xm—2), S&m-2)) = d(xm-1, SQ@m-2)) 
< d(xm-1, Xm-1) + d(Xm-1, S(&m-2)) 
<@+=sdiam(A); 
and repeating the argument used above we now find 
d(xm-2, Xm-2)) < (0 +s diam(A)). 
Repeating the same argument k times we arrive at 
d(xm—z, Xm-z) < 50 +576 +--+ +5""10 + s* diam(A). 
Hence for any positive integer M and any integer n such that 0 <n < M, we have 
d (Xn, Xn) < 80 +570 +--»+5%"—'6 + 5" diam(A). 
Now take the limit of both sides of this equation as M — oo to obtain 
sé 


d(Xn, Xn) < SOL +s+5°+---)= 
(l=; 5) 


for alln = 1, 2,.... 


This completes the proof. 


7 The Meaningfuiness of Inaccurately Computed Orbits 


Examples & Exercises 


7.1. Let us apply the Shadowing Theorem to an orbit on the Sierpinski triangle, 
using the random shift dynamical system associated with the IFS 

i 4 i 

he 5% + > 52 + 5} 

Since the system is just-touching we must assign values to the shift transformation 
applied to the just-touching points. We do this by defining 


fc; 


S(x1 + 1x2) = 2x; mod 1 + i(2x2 mod 1). 


We consider the orbit of the point x9 = (0.2147, 0.0353). We compute the first 11 
points on the exact orbit of this point, and compare it to the results obtained when a 


deliberate error 9 = 0.0001 is introduced at each step. We obtain: 


Errorful 
Xo = (0.2147, 0.0353) 
X; = (0.4295, 0.0705) 
X2 = (0.8591, 0.1409) 
x3 = (0.7183, 0.2817) 
X4 = (0.4365, 0.5635) 
X5 = (0.8731, 0.1269) 
X6 = (0.7463, 0.2537) 
X7 = (0.4927, 0.5073) 
Xg = (0.9855, 0.0145) 
Xo = (0.9711, 0.0289) 
X19 = (0.9423, 0.0577) 


Exact 
$° (Xo) = (0.2147, 0.0353) 
S°!(%o) = (0.4294, 0.0706) 
$°2 (Xo) = (0.8588, 0.1412) 
$°3 (Xo) = (0.7176, 0.2824) 
S* (Xo) = (0.4352, 0.5648) 
S°> (Xo) = (0.8704, 0.1296) 
$°®(X9) = (0.7408, 0.2592) 
S°" (Xo) = (0.4816, 0.5184) 
S°8(X9) = (0.9632, 0.0368) 
S°° (Xo) = (0.9264, 0.0736) 
$°!9(xo) = (0.8528, 0.1472) 


Notice how the orbit with errors diverges from the exact orbit of xo. Nonetheless, 
the shadowing theorem asserts that there is an exact orbit {x,} such that 


1 
atn, Xn) < 1 2 


ae & 
2 


(0.0001) = 0.0001, 


where d(-, -) denotes the Manhattan metric. This really seems unlikely; but it must 
be true! Here’s an example of such a shadowing orbit, also computed exactly. 


Exact Shadowing Orbit xn = S" (xo) 


d(Xn, Xn) < 0.0001 


Xo = (0.21478740234375, 0.03521259765625) 0.00009 
X1 = (0.4295748046875, 0.0704251953125) 0.00008 
X2 = (0.8591496093750, 0.1408503906250) 0.00005 
x3 = (0.7182992187500, 0.28170078 12500) 0.000001 
X4 = (0.4365984375000, 0.5634015625000) 0.0001 
X5 = (0.8731968750000, 0.1268031250000) 0.0001 
X6 = (0.7463937500000, 0.2536062500000) 0.0001 
x7 = (0.4927875000000, 0.5072125000000) 0.00009 
Xg = (0.9855750000000, 0.0144250000000) 0.00008 
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Figure IV.125. The TRUE ORBIT OF ,, ene 
Shadowing Theorem tells 
us there is an exact orbit COMPUTED ORBIT OF ere 
closer to {x,} than0.03 for 4 
all n. Be Al 
. errors are less than 0.03 
True orbit of X already 
~ far from the computed orbit 
O 1 
X9 = (0.9711500000000, 0.0288500000000) 0.00005 
X19 = (0.9423000000000, 0.0577000000000) 0.000000 


Figure IV.125 illustrates the idea. 


7.2. Consider the shift dynamical system {X; T} on the code space of two symbols 
{1, 2}. Show that the sequence of points {x,} given by 


Xo =212,andx,=12  foralln=1,2,3,... 


is an errorful orbit for the system. Illustrate the divergence of T°’xo from x,. Find 
a shadowing orbit {x,}?°) and verify the error estimate provided by the Shadowing 
Theorem. 


7.3. Illustrate the Shadowing Theorem by constructing an erroneous orbit, and an 
orbit that shadows it, for the shift dynamical system {[0, 1]; zx, 5x + 5} 


7.4. Compute an orbit for a random shift dynamical system associated with the 
overlapping IFS {[0, 1]; $x, $x + 5}. 


7.5. An orbit of the shift dynamical system associated with the IFS 


(365): 36) +a) 2G) #6) 8G) +G)}. 


7 The Meaningfuiness of inaccurately Computed Orbits 


Figure 1V.126. An ex- 

act orbit shadows the orbit 
“computed” by “drawing” 
in this web diagram for 

a random shift dynamical 

system. 


0 X 1 


is computed to accuracy 0.0005. How close a shadowing orbit does there exist? Use 
the Manhattan metric. 


7.6. In Figure IV.126 an orbit of the random shift dynamical system associated with 
the overlapping IFS {[0, 1], w,(x), w2(x)} is computed by drawing a web diagram. 
The computer in this case consists of a pencil and a drafting table. Estimate the 
errors in the drawing and then deduce how closely an exact orbit shadows the plotted 
one. You will need to estimate the contractivity of the IFS. Also draw a tube around 
the plotted orbit, within which an exact orbit lies. 


7.7. Figure IV.127 shows an orbit {x,} of the random shift dynamical system associ- 
ated with the IFS {[0, 1]; w;(x), w2(x)}. It was obtained by defining S(x) = w, V(x) 
for x € w;(A) M w2(A). A contractivity factor for the IFS is readily estimated from 
the drawing to be z. Hence if the web diagram is accurate to within 1 mm at each 
iteration, that is 


d(Xn+1, S(Xn)) < 1mm, 


then there is an exact orbit {x, = S°’(xo)}°°., such that 


Thus there is an actual orbit that remains within the “orbit tube” shown in Fig- 
ure [V.127. 
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Figure 1V.127. Only 
the Shadow knows. Inside 
the “orbit tube” there is an 
exact orbit {x,}°°, of the 
random shift dynamical 
system associated with the 
IFS. 


8 Chaotic Dynamics on Fractals 


The shift dynamical system {A; S} associated with a totally disconnected hyperbolic 
IFS is equivalent to the shift dynamical system {2,7}, where © is the code space 
associated with the IFS. As we have seen, this equivalence means that the two 
systems have a number of properties in common; for example, the two systems have 
the same number of cycles of minimal period 7. A particularly important property 
that they share is that they are both “chaotic” dynamical systems, a concept that we 
explain in this section. First, however, we want to underline that the two systems are 
deeply different from the point of view of the interplay of their dynamics with the 
geometry of the underlying spaces. 
Consider the case of an IFS of three transformations. Let £ denote the code space 
of the three symbols {1, 2, 3}, and look at the orbit of the point 0 € © given by 
= 
1231112132122233132331111 
A2113421 922 1235151 132133211 212213 
22122222323 123223331131231332132 
23233313323331111111211131121112 
21d ISIS Sie isk 
11222122312311232123313 
pits ee FOREVER. 
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Figure 1V.128. The 
Start of a chaotic orbit on a 
Ternary Cantor set. 


This orbit {T°"a}°° , may be plotted on a Cantor set of three symbols, as sketched 
in Figure I[V.128. This can be compared with the orbit {5° (@ (7 ))}P- 5 of the shift 
dynamical system {A, S} associated with an IFS of three maps, as plotted in Fig- 
ure IV.129. Figure [V.130 shows an equivalent orbit, but this time for the just- 
touching IFS {[0, 1]; 3x, 3x + 4, $x + 3}, displayed using a web diagram. 

In each case the “same” dynamics look entirely different. The qualities of beauty 
and harmony present in the observed orbits are different. This is not suprising: the 
equivalence of the dynamical systems is a topological equivalence. It does not pro- 
vide much information about the interplay of the dynamics with the geometries of 
the spaces on which they act. This interplay is an open area for research. For ex- 
ample, what are the special conserved properties of two metrically equivalent dy- 
namical systems? Can you quantify the grace and delicacy of a dancing orbit on a 
fractal? 

This said, we turn our attention back to an important collection of properties 
shared by all shift dynamical systems. For simplicity we formalize the discussion for 
the case of the shift dynamical system {A, S} associated with a totally disconnected 
hyperbolic IFS. 


Definition 8.1 Let (X, d) be a metric space. A subset B C X is said to be dense 
in X if the closure of B equals X. A sequence {Xn}°° of points in X is said to be 
dense in X if, for each point a € X, there is an subsequence {Xq,}°°_ that converges 
to a. In particular an orbit {x,}P° of a dynamical system {X, f} is said to be dense 


in X if the sequence {x;,}°°, is dense in X. 


By now you will have had some experience with using the random iteration al- 
gorithm, Program 2 of Chapter III, for computing images of the attractor A of IFS 
in R*. If you run the algorithm starting from a point xp € A, then all of the com- 
puted points lie on A. Apparently, the sequences of points we plot are examples of 
sequences that are dense in the metric space (A, d). 

The property of being dense is invariant under homeomorphism : if B is dense in 
a metric space (X, d) and if 6 : X — Y is a homeomorphism, then 6(B) is dense in 
Y. If {X; f} and {Y, g} are equivalent dynamical systems under 9; and if {x,} is an 
orbit of f dense in X, then {9(x,,)} is an orbit of g dense in Y. 
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Figure IV.129. The 
start of an orbit of a de- 
terministic shift dynami- 
cal system. This orbit is 
chaotic. It will visit the 
part of the attractor inside 
each of these little circles 
infinitely many times. 


Figure 1V.130. Equiv- 
alent orbit to the one 

in Figures [V.128 and 
IV.129, this time ploted 
using a web diagram. The 
starting point has address 
12394121321222331 5... 
This manifestation of an 
orbit, which goes arbi- 
trarily close to any point, 
takes place on a just- 
touching attractor. 


I'll visit you again and again! 


8 Chaotic Dynamics on Fractals 


Definition 8.2 A dynamical system {X, f} is transitive if, whenever U and V are 
open subsets of the metric space (X, d), there exists a finite integer n such that 


UN fF" (V) FO. 


The dynamical system {[0, 1]; f(x) = min{2x, 2 — 2x}} is topologically transi- 
tive. To verify this just let 4 and V be any pair of open intervals in the metric 
space ({0, 1], Euclidean). Clearly, each application of the transformation increases 
the length of the interval U/ in such a way that it eventually overlaps V. 


Definition 8.3 The dynamical system {X; f} is sensitive to initial conditions if 
there exists 5 > 0 such that, for any x € X and any ball B(x, €) with radius € > 0, 
there is y € B(x, €) and an integer n > 0 such that d(f°"(x), f°"(y)) > 4. 


Roughly, orbits that begin close together get pushed apart by the action of the 
dynamical system. For example, the dynamical system {[0, 1]; 2x mod 1} is sensitive 
to initial conditions. 


Examples & Exercises 
8.1. Show that the rational numbers are dense in the metric space (IR, Euclidean). 


8.2. Let C(n) be a counting function that counts all of the rational numbers that lie 
in the interval [0, 1]. Let 7c¢(,) denote the nth rational number in [0, 1]. Prove that the 
sequence of real numbers {rcq) € [0, 1]: = 1, 2, 3, ...} is dense in the metric space 
({0O, 1], Euclidean). 

8.3. Consider the dynamical system {[0, 1]; f(x) = 2x mod 1}. Find a point xo € 
[0, 1] whose orbit is dense in [0, 1]. 

8.4. Show that the dynamical system {[0, 00) : f(x) = 2x} is sensitive to initial 
conditions, but that the dynamical system {[0, 00) : f(x) = (0.5)x} is not. 

8.5. Show that the shift dynamical system {X; T}, where & is the code space of two 
symbols, is transitive and sensitive to initial conditions. 


8.6. Let {X, f} and {Y, g} be equivalent dynamical systems. Show that {X, f} is 
transitive if and only if {Y, g} is transitive. In other words, the property of being 
transitive is preserved between equivalent dynamical systems. 


Definition 8.4 A dynamical system {X, f} is chaotic if 
(1) it is transitive; 

(2) it is sensitive to initial conditions; 

(3) the set of periodic orbits of f is dense in X. 


Theorem 8.1 The shift dynamical system associated with a totally disconnected 
hyperbolic IFS of two or more transformations is chaotic. 


Sketch of Proof: First one establishes that the shift dynamical system {Z; T} is 
chaotic where & is the code space of N symbols, with N > 2. One then uses the code 
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space map @ : 4 — A to carry the results over to the equivalent dynamical system 
{A; S}. 

Theorem 1 applies to the lifted IFS associated with a hyperbolic IFS. Hence the 
lifted shift dynamical system associated with an IFS of two or more transformations 
is chaotic. In turn this implies certain characteristics to the behavior of the projection 
of a lifted shift dynamical system, namely a random shift dynamical system. 

Let us consider now why the random iteration algorithm works, from an intuitive 
point of view. Consider the hyperbolic IFS {R*; w;, w2}. Let a € A; suppose that the 
address of a is o € &, the associated code space. That is 


a=d(o). 


With the aid of a random-number generator, a sequence of one million ones and 
twos is selected. For example, suppose that the the actual sequence produced is the 
following one, which has been written from right to left, 


21... 12121121121211121112111111211211121111211212122211 


By this we mean that the first number chosen is a 1, then a 1, then three 2’s, and so 
on. Then the following sequence of points on the attractor is computed: 


a=¢(o) 
w(a) = (10) 
w1 0 wi(a) = d(110) 
W20W,;0Ww (a) = ¢(211c) 
W2 0 W20 WwW; 0 Ww; (a) = (22110) 
W2 0 W20 W2 0 W; OW) (a) = (222110) 
W1 0 W20W20W20 w; OW) (a) = (1222110) 
W2 0 W} OW20 W20W20 WwW; OW) (a) = (21222110) 
W1 0 W720 W, 0W20 W20W2 0 W OW; (2) = (121222110) 
W2 0 W] 0 W720 W] OW20 W720 W20 W OW (2) = (2121222110) 
W1 0 W290 W] O W209 WO W720 W20 W200 W] OW) (a) = (121212221160) 


W1 OW O W2 0 W) O W2 0 W) O W2 0 W2 0 W2 OW} O Wi (a) = (1121212221190) 


W20W10...W1 OW) 0W20 W] O W2 0 W] OW20 W20W20 W; OW; (a) = O(21... 112121222110) 


We imagine that instead of plotting the points as they are computed, we keep a list of 
the one million computed points. This done, we plot the points in the reverse order 


8 Chaotic Dynamics on Fractals 


from the order in which they were computed. That is, we begin by plotting the point 
o(21...112121222110) and we finish by plotting the point ¢(0). What will we 
see? We will see a million points on the orbit of the shift dynamical system {A; 5}; 
namely, {S°"(@ (21... 112121222110))}-0™. 

Now from our experience with shift dynamics and from our theoretical knowl- 
edge and intuitions what do we expect of such an orbit? We expect it to be chaotic 
and to visit a widely distributed collection of points on the attractor. We are looking 
at part of a “randomly chosen” orbit of the shift dynamical system; we expect it to 
be dense in the attractor. 

For example, suppose that you are doing shift dynamics on a picture of a totally 
disconnected fractal, or a fern. You should be convinced that by making sly adjust- 
ments in the orbit at each step, as in the Shadowing Theorem, you can most easily 
coerce an orbit into visiting, to within a distance € > 0, each point in the image. But 
then the Shadowing Theorem ensures that there is an actual orbit close to our artifi- 
cial one, and it too goes close to every point on the fractal, say to within a distance of 
2€ of each point on the image. This suggests that “most” orbits of the shift dynamical 
system are dense in the attractor. 


Examples & Exercises 


8.7. Make experiments on a picture of the attractor of a totally disconnected hyper- 
bolic IFS to verify the assertion in the last paragraph that “by making sly adjustments 
in an orbit ... you can most easily coerce the orbit into visiting to within a distance 
€ > 0 of each point in the image.” Can you make some experimental estimates of 
how many orbits go to within a distance € > 0, for several values of €, of every point 
in the picture? One way to do this might be to work with a discretized image and to 
try to count the number of available orbits. 


8.8. Run the Random Iteration Algorithm, Program 2 in Chapter III, to produce an 
image of a fractal, for example a fern without a stem as used in Figure IV.129. As 
the points are calculated and plotted, keep a list of them. Then plot the points over 
again in reverse order, this time making them flash on and off on the picture of the 
attractor on the screen, so that you can see where they land. This way you will see 
the interplay of the geometry with the shift dynamics on the attractor. See if the orbit 
is beautiful. If you think that it is, try to make your impression objective. 


We want to begin to formulate the idea that “most” orbits of the shift dynamical 
system associated with a totally disconnected IFS are dense in the attractor. The 
following lemma counts the number of cycles of minimal period p. 


Lemma 8.1 Let {A; S} be the shift dynamical system associated with a totally 
disconnected hyperbolic IFS {X; w;, w2,..., wy}. Let N(p) denote the number of 
distinct cycles of minimal period p, for p € {1, 2,3, ...}. Then 
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p-1 
N(p) =| N? -— kN (k) | /p for. pS 125363: 


k=] 
k divides p 


Proof It suffices to restrict attention to code space, and to give the main idea, 
consider only the case N = 2. For p = 1, the cycles of period 1 are the fixed points 
of 7. The equation 


To =~o0 €L 


implies 0 = 1111 or o = 2222. Thus N(1) = 2. For p =2, any point that lies on a 
cycle of period 2 must be a fixed point of T°”, namely 


To =0, 


where o = 11, 12, 21, or 22. The only cycles here that are not of minimal period 2 
must have minimal period 1. Furthermore, there are two distinct points on a cycle of 
minimal period 2, so 


N (2) = (2? — N(1))/2 = 2/2 = 1. 


One quickly gets the idea. Mathematical induction on p completes the proof for 
Ns 2: 

For N = 2, we find, for example, N(2) = 1, N(3) =2, N(4) = 3, N(5) = 
6, N(6) =9, N(7) = 18, N(8) = 30, N(9) = 56, N(10) = 99, N(11) = 186, 
N (12) = 335, N(13) = 630, N(14) = 1161, N(15) = 2182, A/(16) = 4080, 
N (17) = 7710, N(18) = 14532, M(19) = 27594, MN (20) = 52377. In particular, 
99.9% of all points lying on cycles of period 20 lie on cycles of minimal period 20. 

Here is the idea we are getting at. We know that the set of periodic cycles are 
dense in the attractor of a hyperbolic IFS. It follows that we may approximate the 
attractor by the set of all cycles of some finite period, say period 12 billion. Thus we 
replace the attractor A by such an approximation A, which consists of 2}2:900.000,000 
points. Suppose we pick one of these points at random. Then this point is extremely 
likely to lie on a cycle of minimal period 12 billion. Hence the orbit of a point 
chosen “at random” on the approximate attractor A is extremely likely to consist 
of 12 billion distinct points on A. 

In fact one can show that a statistically random sequence of symbols contains 
every possible finite subsequence. So we expect that the set of 12 billion distinct 
points on A is likely to contain at least one representative from each part of the 
attractor! 


Chapter V 
Fractal Dimension 


1 Fractal Dimension 


How big is a fractal? When are two fractals similar to one another in some sense? 
What experimental measurements might we make to tell if two different fractals may 
be metrically equivalent? What is the same about the two fractals in Figure V.131? 

There are various numbers, associated with fractals, which can be used to com- 
pare them. They are generally referred to as fractal dimensions. They are attempts 
to quantify a subjective feeling we have about how densely the fractal occupies the 
metric space in which it lies. Fractal dimensions provide an objective means for 
comparing fractals. 

Fractal dimensions are important because they can be defined in connection with 
real-world data, and they can be measured approximately by means of experiments. 
For example, one can measure the “fractal dimension” of the coastline of Great 
Britain; its value is about 1.2. Fractal dimensions can be attached to clouds, trees, 
coastlines, feathers, networks of neurons in the body, dust in the air at an instant in 
time, the clothes you are wearing, the distribution of frequencies of light reflected by 
a flower, the colors emitted by the sun, and the wrinkled surface of the sea during a 
storm. These numbers allow us to compare sets in the real world with the laboratory 
fractals, such as attractors of IFS. 

We restrict attention to compact subsets of metric spaces. This fits well with the 
idea of modelling the real physical world by subsets of metric spaces. Suppose that 
an experimentalist is studying a physical entity, and he wishes to model this entity 
by means of a subset of R°. Then he can use a compact set for his model. For 
example, he can assume that the distances he measures are Euclidean distances, 
and he can assume that the universe is bounded. He can assume that any Cauchy 
sequence of points in his model set converges to a point in his model set, because 
he cannot experimentally invalidate this assumption. Although mathematically we 
can distinguish between a set and its closure, we cannot make the same distinction 
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Figure V.131. Do the 
two implied fractals have 
the same dimension? 


between their physical counterparts. The assumption of compactness will allow the 
model to be handled theoretically with relative ease. 

Let (X, d) denote a complete metric space. Let A € H(X) be a nonempty com- 
pact subset of X. Let e > 0. Let B(x, €) denote the closed ball of radius € and center 
at a point x € X. We wish to define an integer, (A, €), to be the least number of 
closed balls of radius € needed to cover the set A. That is 


N (A, €) = smallest positive integer M such that A C af —1B(Xn, €), 


for some set of distinct points {x, :n =1,2,...,M}C X. How do we know that 
there is such a number (A, €)? Easy! The logic is this: surround every pointx € A 
by an open ball of radius € > 0 to provide a cover of A by open sets. Because A is 
compact this cover possesses a finite subcover, consisting of an integer number, say 
M, of open balls. By taking the closure of each ball, we obtain a cover consisting 
of M closed balls. Let C denote the set of covers of A by at most M closed balls 
of radius €«. Then C contains at least one element. Let f : C > {1,2,3,...,M } be 
defined by f (c) = number of balls in the cover c € C. Then { f(c):c €C} isa finite 
set of positive integers. It follows that it contains a least integer, V'(A, €). 

The intuitive idea behind fractal dimension is that a set A has fractal dimension 
D tf: 


1 Fractal Dimension 


N (A, €) © Ce~? for some positive constant C. 


Here we use the notation “~” as follows. Let f(€) and g(€) be real valued functions 
of the positive real variable «. Then f(€) © g(€) means that lim,_,o{In(f (€))/ In 
(g(€))} = 1. If we “solve” for D we find that 


_ InN(A,€) —InC 
In(1/e) 
We use the notation In(x) to denote the logarithm to the base e of the positive real 


number x. Now notice that the term In C/1n(1/e€) approaches 0 as e — 0. This leads 
us to the following definition. 


D 


Definition 1.1 Let A € H(X) where (X, d) is a metric space. For each e > 0 let 
N(A, €) denote the smallest number of closed balls of radius « > 0 needed to cover 
A. If 


D= lim 


e—0 


In(V (A, €)) 
In(1/e) 

exists, then D is called the fractal dimension of A. We will also use the notation 

D = D(A) and will say “A has fractal dimension D’’ 


Examples & Exercises 


1.1. This example takes place in the metric space (R*, Euclidean). Let a € X and 
let A = {a}. A consists of a single point in the space. For each € > 0, (A, €) = 1. 
It follows that D(A) = 0. 


1.2. This example takes place in the metric space (R*, Manhattan). Let A denote 
the line segment [0, 1]. Let « > 0. Then it is quite easy to see that V(A,€) = 
—[—1/e], where [x] denotes the integer part of the real number x. In Figure V.132 
we have plotted the graph of In(\V(A, €)) as a function of In(1/e). Despite a rough 
start, it appears clear that 


lim {| er 
e—0 In(1/e) 
In fact for 0 <€e < 1 
In(1/e) = In(—[—1/e]) 3) In(V (A, €)) 2 In(1/e + 1) 2 In(1 + €) + In(1/e) 
In(1/e)~ ~=In(1/e) In(i/e) ~~ In(i/e) In(1 /e) 
Both sides here converge to 1 as € — 0. Hence the quantity in the middle also 


converges to 1. We conclude that the fractal dimension of a closed line segment is 
one. We would have obtained the same result if we had used the Euclidean metric. 


1.3. Let (X, d) be a metric space. Let a, b, c € X, and let A = {a, b, c}. Prove that 
D(A) = 0. 


The following two theorems simplify the process of calculating the fractal dimen- 
sion. They allow one to replace the continuous variable € by a discrete variable. 
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Figure V.132. Plot of 
In({1/x]) as a function of 
In(1/x). This illustrates 

that in the computation Ln({1/X]) 


of the fractal dimension 

one usually evaluates 3 
the limiting “slope” of a 
discontinuous function. In 

the present example this 2 
slope is 1. 


Ln(1/X) ——-> 


~ 
— 


1 2 3 4 


Theorem 1.1 Let A € H(X), where (X, d) is a metric space. Let €, = Cr" for 
real numbers 0 <r < 1andC >0, and integers n = 1,2,3,.... Tf 


[a En)) 


D= i 
soo | In(1/€n) 


n—>oo 
then A has fractal dimension D. 


Proof Let the real numbers r and C, and the sequence of numbers E = {€,:n = 
1,2, 3, ...} be as defined in the statement of the theorem. Define f(€) = max{e, € 
E :€, <€}. Assume that e <r. Then 


fe) <€ < f(©)/r and N(A, f(e)) > N(A, €) => N(A, f(e)/r). 


Since In(x) is an increasing positive function of x for x > 1, it follows that 


eater “ {eee (a) 
In(1/f (€)) ~ { Ind/e) 
: [=< (A, ot. (2) 
In(r/f (€)) 


Assume that (A; €) > oo as € — 0; if not then the theorem is true. The right-hand 
side of equation 2 obeys 


, [| — | 
im ¢ ————————_ } = lim { ——————_- 
«0 In(r/f (€)) n—>00 In(r/€n) 

kim In(V (A, €n)) 
na | in) + InG/en) 
at —_ é)) | 

ms In(1/én) J 
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The left-hand side of equation 2 obeys 


(e f(e)/r)) Jaa [a j10 
e0 In(1/f (€)) In(1/€n) 
= In (A, €n-1)) | 
n—oo | In(1/r) + In(1/€,—-1)) 
ee peer én)) 3 
In(1/€n) 
So as € —> 0 both the left-hand side and the right-hand side of equation 2 approach 


n—> oo 


n->©O 


the same value, claimed in the theorem. By the Sandwich Theorem of calculus, the — = = — 
xes of side (1/2”) 


limit as € — 0 of the quantity in the middle of equation 2 also exists, and it equals cover R?. Here n = 2. See 
the same value. This completes the proof of the theorem. Theorem 1.2. 


Theorem 1.2 The Box Counting Theorem. Let A ¢ H(R”), where the Eu- 
clidean metric is used. Cover R™ by closed square boxes of side length (1/2"), as 
exemplified in Figure V.133 for n = 2 and m = 2. Let N,,(A) denote the number of 


boxes of side length (1/2") which intersect the attractor. If 
= im | ala”) 
In(2") J’ 


then A has fractal dimension D. 


Proof We observe that for m = 1, 2,3,..., 
2-™N,_1 < N(A, 1/2") < Ney for alln = 1,2, 3,..., 


where k(n) is the smallest integer k satisfying k >n—1+ + log, m. The first in- 
equality holds because a ball of radius 1/2” can intersect at most 2” “on-grid” boxes 
of side 1/2”—!. The second follows from the fact that a box of side s can fit inside a 
ball of radius r provided r? > ($)* + (§)? + cldots + ($)? = m($)? by the theorem 
of Pythagoras. Now 


ee [=o ie ["— no 
In(2”) n>oo | In(2”) In(2*@) 


== 3 


n—>0o 


since ae — 1. Since also 


noo | In(2") ~ noo f In(2"-!) Jo’ 


Theorem 1.1 with r = 1/2 completes the proof. 


There is nothing magical about using boxes of side (1/2)” in Theorem 1.2. One 
can equally well use boxes of side Cr”, where C > 0 and 0 <r < 1 are fixed real 
numbers. 
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A PGP NO I 
re wreriee EHH 


of side (1/2”) to cover 

mw Cc R?. We deduce, with 
a feeling of relief, that the 
fractal dimension of @ is 
2. To which collage is this 
related? 


Examples & Exercises 


1.4. Consider the m C R?. It is easy to see that V (@) = 4, N>(m) = 16, N3(@) = 64, 
N4(@) = 256, and in general that NV, (@) = 4” for n = 1, 2, 3, ...; see Figure V.134. 
Theorem 1.2 implies that 


D(®) = jim | 


ie) 


In(V, (@)) _ { In(4") 
———— ; = lim a 
In(2”) noo | In(2”) 
1.5. Consider the Sierpinski triangle A, in Figure V.135, as a subset of (R?, 
Euclidean). 

We see that N/\(A) = 3, N2(A) = 9, N3(A) = 27, N4(A) = 81, and in general 
N,,(A) = 3" for n = 1, 2,3,.... 

Theorem 1.2 implies that 


D(A) = lim 


noo 


| In(Nn(A) _ . [nGy 7 ane) 

In(2") J n>0o | In(2"){ ‘In(2)’ 

1.6. Use the Box Counting Theorem, but with boxes of side length (1/3), to cal- 
culate the fractal dimension of the classical Cantor set C described in exercise 1.5 in 
Chapter III. 


1.7. Use the Box Counting Theorem to estimate the fractal dimension of the fractal 
subset of R? shown in Figure V.136. You will need to take as your first box the 
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a erp pe pe pe Bee ee Pee Pepe pepe || Sloure V.1SS. It re- 


quires 3” closed boxes 
ei 3a “ail se | i: ae of side (1/2)" to cover 
: = ‘all the Sierpinski triangle 

A C R?. We deduce that 


its fractal dimension is 
In(3)/ In(2). 


obvious one suggested by the figure. You should then find that there appears to be 
a pattern to the sequence of numbers Nj, N2, N3, .... 


1.8. The same problem as 1.7, this time applied to Figure V.137. By making the 
right choice of Cartesian coordinate system, you will make this problem easy. 


What happens to the fractal dimension of a set if we deform it “with bounded 
distortion”? The following theorem tells us that metrically equivalent sets have the 
same fractal dimension. For example, the two fractals in Figure V.131 have the same 
fractal dimension! 

Theorem 1.3 Let the metric spaces (X,, d,) and (X2, dz) be metrically equiv- 
alent. Let 0: X; — X> be a transformation that provides the equivalence of the 
spaces. Let Ay € H(X,) have fractal dimension D. Then Az = 8(A}) has fractal di- 
mension D. That is 


D(A) = D(@(A2)). 


Proof This proof makes use of the concepts of the lim sup and lim inf of a func- 
tion. (The lim sup is discussed briefly following Definition 2.1 in the next section.) 

Since the two spaces (Xj, d,) and (X>, d2) are equivalent under 6, there exist 
positive constants e; and e2 such that 


€142(0(x), A(y)) < d(x, y) < e2d2(A(x), O(y)) forallx,yeX);. (3) 
Without loss of generality we assume that e; < 1 < e2. Equation 3 implies 


d(x, 
d>(0(x), 0(y)) < owe for all x, y € X1. 
1 
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Figure V.136. What 
other well-known frac- 
tal has the same fractal 
dimension? 


This implies 

6( B(x, €)) C B(O(x), €/e;)) forall x € X;. (4) 
Now, from the definition of \/(A;,€), we know that there is a set of points 
{x1, X2,...,xXyv} C X1, where NV = AN(Aj,€), such that the set of closed balls 
{B(xn, €):n=1,2,...,.N(A1, €)} provides a cover of Aj. It follows that {9(B(x,, 
€)):n=1,2,...,N(A}, €)} provides a cover of Az. Equation 4 now implies that 
{B(O (xn), €/e1)) :n = 1, 2,..., N(Aj, €)} provides a cover of Az. Hence 

N(A2, €/e;) ne N(A, €). 

Hence, when € < 1, 


In(N(A2, €/e1)) _ In(W(Ai, €)) 


In(1/e) ~~ In(1/e) 
It follows that 
In(N (Ap, €)) 
pd cc . 
SS In(N (Ap, €/e1)) _ {InW(Aj, €)) = 
sora) aU i 


We now seek an inequality in the opposite direction. Equation 3 implies that 
d,(9~'(x), 07"(y)) < erdo(x, y) for all x, y € Xp. 
This tells us that 
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Figure V.137. If you 
choose the “first” box just 


ber NV, of boxes of side 
1/2” which intersect the 
an, jor n= 1,2;35,.-. 


ae 


| Ci % right, the fractal dimen- 

VWDip sion of this fractal is easily 

| estimated. Count the num- 
eth < and apply the Box Count- 


be Mc a. v . ing Theorem. 
i, 5 


yy 
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6-1(B(x, €)) C B(O~!(x), er€)) for all x € Xo, 
and this in turn implies 
N (Aj, €2€) < N (Az, €). 


Hence, when € < l, 


In(N(A, €2€)) - In(N (Az, €)) 


In(i/e) ~ — In(1/e) 
It follows that 
In(V(A1, 
DA = in | 
_ ge J MW C1 22€)) 
ae 7 
_._. { In(N(Az, €)) 
Se 


By combining equations 6 and 7 we obtain 


__. f InW(A2,€)) | _ fat In(N'(Ap, €)) 
is mo 17 (A) = Fim ap a in(1/e) } 
From this it follows that 
4 finW(A2, 6) 
Deda) = i ig fA 


This completes the proof. 
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Examples & Exercises 


1.9. Let C denote the classical Cantor set, living in [0, 1] and obtained by omitting 
“middle thirds.” Let C denote the Cantor set obtained by starting from the closed 
interval [0, 3] and omitting “middle thirds.” Use Theorem 1.3 to show that they 
have the same fractal dimension. Verify the conclusion by means of a box-counting 
argument. 


1.10. Let A be a compact nonempty subset of R?. Suppose that A has fractal di- 
mension D, when evaluated using the Euclidean metric and fractal dimension D> 
when evaluated using the Manhattan metric. Show that D,; = Dp. 


1.11. This example takes place in the metric space (R*, Manhattan). Let A; and A> 
denote the attractors of the following two hyperbolic IFS 


{R?; wi(x, y), we(x, y), w(x, y)} and = {IR?; wa(x, y), ws(x, y), welx, y)}, 


“()=( )G)+() 
“(= )G): 
“()-G )C)+() 
“G)=G G)+() 
“G)=G YG) 
“(= )G)+() 


By finding a suitable change of coordinates, show A, and A> have the same fractal 
dimensions. 


2 The Theoretical Determination of the Fractal Dimension 


The following definition extends Definition 1.1. It provides a value for the fractal 
dimension for a wider collection of sets. 


Definition 2.1 Let (X,d) be a complete metric space. Let A € H(X). Let N(e) 
denote the minimum number of balls of radius € needed to cover A. If 


a InN(é) - 
D= lim {sup {PS :2€ 0.6 | 
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exists, then D is called the fractal dimension of A. We will also use the notation 
D = D(A), and will say “A has fractal dimension D.” 


In stating this definition we have “spelled out” the lim sup. For any function f(e), 
defined for 0 < € < 1, for example, we have 


lim sup f(€) = lim {sup{ f (€) :€ € (0, €)}}. 
«0 = 


It can be proved that Definition 2.1 is consistent with Definition 1.1: if a set has 
fractal dimension D according to Definition 1.1 then it has the same dimension 
according to Definition 2.1. Also, all of the theorems in this book apply with either 
definition. The broader definition provides a fractal dimension in some cases where 
the previous definition makes no assertion. 


Theorem 2.1 Let m be a positive integer; and consider the metric space (R", 
Euclidean). The fractal dimension D(A) exists for all A € H(R”). Let B € H(R”) 
be such that A C B; and let D(B) denote the fractal dimension of B. Then D(A) < 
D(B). In particular, 


0 < D(A) <m. 


Proof We prove the theorem for the case m = 2. Without loss of generality we 
can suppose that A Cm. It follows that (A, €) < A (@, €) for all « > 0. Hence for 
all € such that 0 < € < 1 we have 


i In(NV(A, €)) Z In(Vm, €)) 


= Ip€i fey ICT) 
It follows that 
lim su aoreres < lim sup feeeree : 
e—>0 In(1/e) at In(1/e) 


The lim sup on the right-hand side exists and has value 2. It follows that the lim sup 
on the left-hand side exists and is bounded above by 2. Hence the fractal dimension 
D(A) is defined and bounded above by 2. Also D(A) is nonnegative. 

If A, B € H(R*) with A Cc B, then the fractal dimensions of A and B are defined. 
The above argument wherein m is replaced by B shows that D(A) < D(B). This 
completes the proof. 


The following theorem helps us to calculate the fractal dimension of the union of 
two sets. 


Theorem 2.2 Let m be a positive integer; and consider the metric space (R", 
Euclidean). Let A and B belong to H(R”™). Let A be such that its fractal dimension 
is given by 
In(NV (A, €)) 


DAs im | in(1/e) 
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Let D(B) and D(A U B) denote the fractal dimensions of B and A U B, respectively. 
Suppose that D(B) < D(A). Then 


D(A U B) = D(A). 


Proof From Theorem 2.1 it follows that D(A U B) > D(A). We want to show 
that D(A U B) < D(A). We begin by observing that, for all « > 0, 


N(AUB,€) <N(A,€)+N(B, €). 


It follows that 


D(A U B) = lim sup 


e—0 


In(NV(A U B, €)) 
In(1/e) 
In (A, €) + V(B, €)) 
In(1/e) 
, In(V(A, €)) {i +N(B, €)/N(A, €)) 
< lm.sup { —-— oa REE a aes ee 
<0 In(1/e) In(1/e) 

The proof is completed by showing that V’(B, €)/N(A, €) is less than 1 when € is 
sufficiently small. This would imply that the second limit on the right here is equal 
to zero. The first limit on the right converges to D(A). 


< lim sup 
e—0 


+ lim sup 
e—0 


Notice that 
In(V(B,€)) _. 
———___—— :€ <€ 
In(1/e) 
is a decreasing function of the positive variable ¢. It follows that 


In(V (B, €)) 

In(1/e) 
Because the limit explicitly stated in the theorem exists, it follows that 
In(NV (B, €)) . In (A, €)) 

In(1/e) In(1/eé) 

This allows us to conclude that 

N(B,€) 
N(A, €) 

This completes the proof. 


< D(A) for all sufficiently small € > 0. 


for all sufficiently small € > 0. 


< 1 for all sufficiently small € > 0. 


Examples & Exercises 


2.1. The fractal dimension of the hairy set A C R?, suggested in Figure V.138, is 2. 
The contribution from the hairs to (A, €) becomes exponentially small compared 
to the contribution from m, as « —> 0. 


We now give you a wonderful theorem that provides the fractal dimension of the 
attractor of an important class of IFS. It will allow you to estimate fractal dimensions 
“on the fly,” simply from inspection of pictures of fractals, once you get used to it. 
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Figure V.138. Picture 
\ of a hairy box. The fractal 
dimension of the subset 
of R* suggested here is 
the same as the fractal 
dimension of the box. The 
hairs are overpowered. 


e 


Theorem 2.3 Let {R”; wi, w2,..., wy} be a hyperbolic IFS, and let A de- 
note its attractor. Suppose wy, is a similitude of scaling factor s, for each n € 
{1,2,3,..., N}. If the IFS is totally disconnected or just-touching then the attractor 
has fractal dimension D(A), which is given by the unique solution of 


N 
> Isl? =1, D(A) € [0, m]. 
n=l 


If the IFS is overlapping, then D > D(A), where D is the solution of 
N see 
> ela Pete). 
n=1 
Sketch of proof The full proof can be found in [Bedford 1986], [Hardin 1985], 
(Hutchinson 1981], and [Reuter 1987]. The following argument gives a valuable 
insight into the fractal dimension. We restrict attention to the case where the IFS 
{R”; wi, w2,..., wy} is totally disconnected. We suppose that the scaling factor s; 
associated with the similitude w; is nonzero for each i € {1,2,..., N}. Let e > 0. 
We begin by making two observations. 
Observation (i): Let i € {1,2,..., N}. Since w; is a similitude of scaling factor 
s;, it maps closed balls onto closed balls, according to 


w; (B(x, €)) = B(w;(x), |sil€). 
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Assume that s; 4 0 . Then w; is invertible, and obtain 

w; (B(x, €)) = B(w; '(x), |si|7"e). 
The latter two relations allow us to establish that for all e« > 0, 

N(A, €) =N(w;(A), |sile); 
which is equivalent to 
N(w;(A), €) = N(A, |si|7"e). (1) 
This applies for eachi € {1,2,3,..., N}. 
Observation (ii): The attractor A of the IFS is the disjoint union 
A = w (A) U w2(A) U...U wy (A), 


where each of the sets w,,(A) is compact. Hence we can choose the positive number 
€ so small that if, for some point x € R* and some integer i € {1, 2,..., N}, we have 
B(x, €) 1 w;(A) £@, then B(x, €) w,(A) = @ forall j € {1,2,..., N} with j 4. 
It follows that if the number € is sufficiently small we have 


N(A, €) =N(w1(A), €) + N(w2(A), €) +N (w3(A), €) +++» + N(ww(A), €) 


We put our two observations together. Substitute from equation 1 into the last equa- 
tion to obtain 
N(A, €) =N(A, |s1|~'€) + N(A, |52|7"€) 
+N(A, |s3|~"€) +--+ M(A, Iswl"e). 

This functional equation is true for all positive numbers € that are sufficiently small. 
The proof is completed by showing formally that this implies the assertion in the 
theorem. 

Here we demonstrate the reasonableness of the last step. Let us make the assump- 
tion V(A, €) ~ Ce~? . Then substituting into equation 2 we obtain the equation: 


(2) 


Ce“? & Cls,|Pe~? + Cls2|Pe~? + Cls3|2e 9 +--+ Clsy|Pe?. 
From this we deduce that 
1 = |s|? + |s2|? + |s3]? + --- + [sal?. 


This completes our sketch of the proof of Theorem 2.3. 


Examples & Exercises 


2.2. This example takes place in the metric space (R*, Euclidean). A Sierpinski 
triangle is the attractor of a just-touching IFS of three similitudes, each with scaling 
factor 0.5. Hence the fractal dimension is the solution D of the equation 


(0.5)? + (0.5)? + (0.5)? = 1 
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Figure V.139. The 
Castle fractal. This is 
an example of a self- 
similar fractal, and its 
fractal dimension may 
be calculated with the 
aid of Theorem 2.3. The 
associated IFS code is 
given in Table V.1. 


from which we find 


_ In(1/3) _ In@) 
~ 0.5) in. 


2.3. Find a just-touching IFS of similitudes in R? whose attractor is m. Verify that 
Theorem 2.3 yields the correct value for the fractal dimension of @. 


2.4. The classical Cantor set is the attractor of the hyperbolic IFS 


1 l 2 
{(0, 1]; wi(x) = 37 w1(x) = ae + 3): 


Use Theorem 2.3 to calculate its fractal dimension. 


2.5. The attractor of a just-touching hyperbolic IFS {R?; w;(x),i = 1, 2, 3, 4} is 
represented in Figure V.139. The affine transformations w; : R? > R? are simili- 
tudes and are given in tabular form in Table V.1. Use Theorem 2.3 to calculate the 
fractal dimension of the attractor. 


2.6. The attractor of a just-touching hyperbolic IFS {R*; w;(x), i = 1, 2, 3} is rep- 
resented in Figure V.140. The affine transformations w; : R* > R? are similitudes. 
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Table V.1. IFS code for a Castle. 


w 20s bere shoe =f p 

| Be. 8.0. 06..0 0 025 
2 Of... 85...7 0.2005 
3: B40 :0.:04 0 4: 50 
5505-028) OS 2 1-2-0395 


Figure V.140. To cal- 
culate the fractal dimen- 
sion of the subset of R* 
represented here, first ap- 
ply the Collage Theorem 
to find a corresponding set 
of similitudes. Then use 
Theorem 2.3. 


X XMaKX I 
ay [e, 


Use the Collage Theorem to find the similitudes, and then use Theorem 2.3 to calcu- 
late the fractal dimension of the attractor. 


2.7. Figure V.141 represents the attractor of an overlapping hyperbolic IFS 
{R?; w;(x), i = 1, 2, 3, 4}. 


Use the Collage Theorem and Theorem 2.3 to obtain an upper bound to the fractal 
dimension of the attractor. 


2.8. Calculate the fractal dimension of the subset of R? represented by Figure V.142. 
2.9. Consider the attractor A of a totally disconnected hyperbolic IFS 
{R’; w;(x), i = 1, 2} 
where the two maps 
w,:R’>R’ and w,:R’ > R’ 
are similitudes, of scaling factors s; and s2, respectively. Show that A is also the 
attractor of the totally disconnected hyperbolic IFS {R’; v;(x), i = 1, 2, 3, 4} where 


Vj = UW] O Wj, V2 = W O W2, V3 = W2 0 W), and v4 = wW2 O W2. Show that v;(x) is a 
similitude, and find its scaling factor, for i = 1, 2,3, 4. Now apply Theorem 2.3 to 
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Figure V.141. An up- 
per bound to the fractal 
dimension of the attractor 
of an overlapping IFS, cor- 
responding to this picture, 
can be computed with the 
aid of Theorem 2.3. 


A Figure V.142. Calcu- 
IMGN late the fractal dimension 
of the subset of R* repre- 

sented by this image. 
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yield two apparently different equations for the fractal dimension of A. Prove that 
these two equations have the same solution. 


3 The Experimental Determination of the Fractal Dimension 


In this section we consider the experimental determination of the fractal dimension 
of sets in the physical world. We model them, as best we can, as subsets of (R?, 
Euclidean) or (IR*, Euclidean). Then, based on the definition of the fractal dimen- 
sion, and sometimes in addition on one or another of the preceding theorems, such 
as the Box Counting Theorem, we analyze the model to provide a fractal dimension 
for the real-world set. 

In the following examples we emphasize that when the fractal dimension of a 
physical set is quoted, some indication of how it was calculated must also be pro- 
vided. There is not yet a broadly accepted unique way of associating a fractal di- 
mension with a set of experimental data. 


Example 


3.1. There is a curious cloud of dots in the woodcut in Figure V.143. Let us try to 
estimate its fractal dimension by direct appeal to Definition 1.1. 

We begin by covering the cloud of points by disks of radius « for a range of €- 
values from € = 3 cm down to € = 0.3 cm; and in each case we count the number of 


Figure V.143. Cover- 
ing a cloud of dots in a 
woodcut by balls of radius 
e>0. 
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Table V.2. Minimal numbers of balls, of various radii, needed to cover 
a “dust” in a woodcut. 


€ N (A, €) 
3 cm 2 
2cm 3 
1.5 cm 4 
1.2cm 6 
1 cm 7 
0.75 cm 10 
0.5 cm 16 
0.4 cm 23 
0.3 cm 31 
0.015 cm 267 


Table V.3. The data in Table III.1 is tabulated in log—log form. These values 
are used to obtain the fractal dimension. 


In(i/e) In(N(A,€)) 


-1.1 0.69 
-0.69 1.09 
-0.405 1.39 
-0.182 1,79 
0 1.95 
0.29 2.30 
0.693 eae | 
0.916 3.13 
1.204 3.43 
4.2 J.39 


disks needed. This provides the set of approximate values for (A, €) given in Table 
III.1. The data is redisplayed in log—log format in Table V.3. The data in Table V.3 is 
plotted in Figure V.144. A straight line that approximately passes through the points 
is drawn. The slope of this straight line is our approximation to the fractal dimension 
of the cloud of points. 

The experimental number (A, 0.015 cm) is not very accurate. It is a very rough 
estimate based on the size of the dots themselves and is not included in the plot in 
Figure V.144. The slope of the straight line in Figure V.144 gives 


D(A) = 1.2, over the range 0.3 cm to 3cm, (1) 


where A denotes the set of points whose dimension we are approximating. 
The straight line in Figure V.144 was drawn “by eye.” Thus if one was to repeat 
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Figure V.144. Log- ee ee ee ee 
log plot to estimate the 

fractal dimension D for 

the cloud of dots in the 

woodcut in Figure V.143. 

The data is in Tables [II.1 Log( N(A. E) ) 


0.69 
11 Log(1/E) 4.2 


the experiment, a different value for D(A) may be obtained. In order to make the re- 
sults consistent from experiment to experiment, the straight line should be estimated 
by a least squares method. 

In proceeding by direct appeal to Definition 1.1, the estimates of N (A, €) need 
to be made very carefully. One needs to be quite sure that N (A, €) is indeed the 
least number of balls of radius € needed. For large sets of data this could be very 
time-consuming. 

It is clearly important to state the range of scales used: we have no idea or defi- 
nition concerning the structure of the dots in Figure V.143 at higher resolutions than 
say 0.015 cm. Moreover, regardless of how much experimental data we have, and 
regardless of how many scales of observation are available to us, we will always end 
up estimating the slope of a straight line corresponding to a finite range of scales. If 
we include the data point (0.015 cm, 267) in the above estimation, we obtain 


D(A) ~ 0.9, over the range of scales 0.015 to 5cm. (2) 


We comment on the difference between the estimates in equations 1 and 2. If we 
restrict ourselves to the range of scales in equation 1, there little information present 
in the data to distinguish the cloud of points from a very irregular curve. However, 
the data used to obtain equation 2 contains values for N (A, €) for several values of 
€ such that the corresponding coverings of A are disconnected. The data is “aware” 
that A is disconnected. This lowers the experimentally determined value of D. 


3.2. In this example we consider the physical set labelled A in Figure V.145. A is 
actually an approximation to a classical Cantor set. In this case we make an exper- 
imental estimate of the fractal dimension, based on the Box Counting Theorem. A 
Cartesian coordinate system is set up as shown and we attempt to count the num- 
ber of square boxes N,,(A) of side (1/2”) which intersect A. We are able to obtain 
fairly accurate values of N,(A) for n = 0, 1, 2, 3, 4,5, and 6. These values are pre- 
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Pe -CHECH CBee MOUSE: Val AD - Sngnre: 
: sive subdivision of over- 
laying grid to obtain the 
box counts needed for 
the application of Theo- 
rem 1.2 to estimate the 
fractal dimension of the 
Cantor set A. The counts 
are presented in Table V.4. 


2 
S a 
3 ean a 
scan Oe gn in 
RH PEE 


0 


Table V.4. The data determined from Figure 145, in the experimental calculation 
of the fractal dimension of the physical set A. 


n N,(A) InN,(A) nin2 


0 1 0 0 

I 3 110 0.69 
BOOUTOp oe 18 Bae gay 
392 10 2.30 2.08 
4) smt9 29403 277 
Siiste139 3.50 3.46 
6 658 4.06 4.16 


sented in Table V.4. We note that these values depend on the choice of coordinate 
system. Nonetheless the values of \,,(A) are much easier to measure than the values 
of N’(A, €) used in example 3.1. 

The analysis of the data proceeds just as in example 3.1. It is represented in 
Table V.4 and Figure V.146. We obtain 


1 
D(A) ~ 0.8, over the range 3 inch to 8 inches. 


3.3. In this example we show how a good experimentalist [Strahle 1987] overcomes 
the inherent difficulties with the experimental determination of fractal dimensions. 
In so doing he obtains a major scientific result. The idea is to compare two sets of 
experimental data, obtained by different means, on the same physical system. The 
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Figure V.146. Slope 5 
of the plot of the data 

in Table V.4 gives an 

approximation to the 

fractal dimension of the 


set A in Figure V.145. 
: Log (Np) 


n Log (2) ——_—-> 


physical system is a laboratory jet flame. The data are time series for the temperature 
and velocity at two different points in the jet. The idea is to apply the same procedure 
to the analysis of the two sets of data to obtain a value for the fractal dimension. 
The two values are the same. Instead of drawing the conclusion that the two sets of 
data “have the same fractal dimension,” he deduces that the two sets of data have a 
common source. That common source is physical, real-world chaos. 

The experimental setup is as follows. A flame is probed by (a) a laser beam and 
(b) a very thin wire. These two probes, coupled with appropriate measuring devices, 
allow measurements to be made of the temperature and velocity in the jet, at two 
different points, as a function of time. In (a) the light bounces off the fast-moving 
molecules in the exhaust, and a receiver measures the characteristics of the bounced 
light. The output from the receiver is a voltage. This voltage, suitably rescaled, gives 
the temperature of the jet as a function of time. In (b) a constant temperature is 
maintained through a wire in the flame. The voltage required to hold the temperature 
constant is recorded. This voltage, suitably rescaled, gives the velocity of the jet as 
a function of time. In this way we obtain two independent readings of two different, 
but related, quantities. 

Of course the experimental apparatus is much more sophisticated than it sounds 
from the above description. What is important is that the measuring devices are of 
very high resolution, accuracy, and sensitivity. A reading of the velocity can be made 
once every microsecond. In this example the temperature was read every 0.5 x 107+ 
sec. Vast amounts of data can be obtained. A sample of the experimental output from 
(a) is shown in Figure V.147, where it is represented as the graph of voltage against 
time. It is a very complex curve. If one “magnifies up” the curve, one finds that its 
geometrical complexity in the curve continues to be present. It is just the sort of thing 
we fractal geometers like to analyze. 

A sample of the experimental output from (b) is shown in Figure V.148, again 
represented as a graph of voltage against time. You should compare Figures V.147 
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RAYLEIGH SCATTERING VOLTAGE Figure V.147. Graph 
of voltage as a function of 
time from an experimental 
probe of a turbulent jet. 
In this case the probe 
measures scattering of a 
laser beam by the flame. 


HOT FILM VOLTAGE Figure V.148. Graph 
time from an experimental 
: Expanded Area | probe of a turbulent jet. In 

| 
| i 2 sures the voltage across 
( | ; a wire in the flame. This 
\ | 
arm Th Wn) 
| | IH it , iil | | character, as demonstrated 
| pre WT | IF by the expanded piece 


of voltage as a function of 
a a | : | | | this case the probe mea- 
CT CPU mT Dt | | 
i NA | | | I " | i | | : i 1. | | data has a definite fractal 
AOL A LTV 
shown in Figure V.149. 


! 
| 
iy! 
lj} % 
* 
HHA 


and V.148. They look different. Is there a relationship between them? There should 
be: they both probe the same burning gas and they are in the same units. 

In order to bring out the fractal character in the data, an expanded piece of the 
data in Figure V.148 is shown in Figure V.149. 

The fractal dimensions of the graphs of the two time series, obtained from (a) 
and (b), is calculated using a method based on the Box Counting Theorem. Exactly 
the same method is applied to both sets of data, over the same range of scales. Fig- 
ure V.150 shows the graphical analysis of the resulting box counts. Both experiments 
yield the same value 


D = 1.5, over the range of scales 2° x 10~> sec to 2!3 x 1079 sec. 


This suggests that, despite the different appearances of their graphs, there is a com- 
mon source for the data. 

We believe that this common source is chaotic dynamics of a certain special flavor 
and character, present in the jet flame. If so then fractal dimension provides an 
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experimentally measurable parameter that can be used to characterize the brand of 
Choas. | 


3.4. Use a method based on the direct application of Definition 1.1 to make an 
experimental determination of the fractal dimension of the physical set defined by 
the black ink in Figure V.151. Give the range of scales to which your result applies. 


3.5. Use a method based on the Box Counting Theorem, as in Example 3.2, to es- 
timate the fractal dimension of the “random dendrite” given in Figure V.152. State 
the range of scales over which your estimate applies. Make several complete experi- 
ments to obtain some idea of the accuracy of your result. 


3.6. Make an experimental estimate of the fractal dimension of the dendrite shown 
in Figure V.153. Note that a grid of boxes of size (1/12)th inch by (1/12)th inch has 
been printed on top of the dendrite. Compare the result you obtain with the result 
of exercise 3.5. It is important that you follow exactly the same procedure in both 
experiments. 


3.7. Make an experimental determination of the fractal dimension of the set in 
Figure V.142. Compare your result with a theoretical estimate based on Theorem 2.3, 
as in exercise 2.7. 


3.8. Obtain maps of Great Britain of various sizes. Make an experimental determi- 
nation of the fractal dimension of the coastline, over as wide a range of scales as 
possible. 


3.9. Obtain data showing the variations of a Stock Market index, at several different 
time scales, for example, hourly, daily, monthly, and yearly. Make an experimental 
determination of the fractal dimension. Find a second economic indicator for the 
same system and analyze its fractal dimension. Compare the results. 


Figure V.149. A EXPANDED AREA HOT FILM VOLTAGE 
blowup of a piece of the 
graph in Figure V.148. 
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3 - Figure V.150. Graphi- 
ete ue cal analysis of box counts 
! associated with experi- 
ments (a) and (b). The 
data analyzed is illus- 
trated in Figures V.145 
and V.148. The two data 
streams, which come from 
probes of a single turbu- 
lent system, when ana- 
lyzed in exactly the same 
way, yield the same value 
D = 1.5 for the fractal di- 
mension. This suggests 
@ Wt £ 10 that, despite the differ- 
ent appearances of their 
graphs, there is a com- 
mon source for the data. 
This source is chaotic dy- 
namics of a certain special 
flavor and character. The 
fractal dimension provides 
a measurable symptom of 
the brand of chaos. 


Hot Filw 
LOG N 


Slope = -1.5 


4 The Hausdorff-Besicovitch Fractal Dimension 


The Hausdorff-Besicovitch dimension of bounded subset of R” is another real num- 
ber that can be used to characterize the geometrical complexity of bounded subsets 
of R”. Its definition is more complex and subtle than that of the fractal dimension. 
One of the reasons for its importance is that it is associated with a method for com- 
paring the “sizes” of sets whose fractal dimensions are the same. It is harder to work 
with than the fractal dimension, and its definition is not usually used as the basis of 
experimental procedures for the determination of fractal dimensions of physical sets. 
Throughout we work in the metric space (R”, d) where m is a positive integer and 
d denotes the Euclidean metric. Let A C R” be bounded. Then we use the notation 


diam(A) = sup{d(x, y): x, y € A}. 


Let 0 < € < 00, and 0 < p < oo. Let A denote the set of sequences of subsets 
{A; C A}, such that A = U2, A;. Then we define 
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Figure V.151. Make 
an experimental estimate 
of the fractal dimension of 
the set A of black ink, 
above, over the range 

of scales 5 inches to 

0.1 inches. Base your 
experimental method 
directly on Definition 1.1. 
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(b) 


Figure V.152. Make an experimental estimate of the fractal dimension of this set in (a), 

over the range of scales 5 inches to (1/12)th inch, basing your method on the Box Counting 
Theorem and graphical analysis. In order to help you with your work, in (b) we have overlaid 
the set a grid of boxes (1/12)th inch by (1/12)th inch. 


SQUSSRSaSSuacSESaSGTSEeer SEESKSCRSECUSESREGSSeEeusesEaNooen Figure V.153. Make 


an experimental estimate 
of the fractal dimension 
of the random dendrite 
shown here. Note that 

a grid of boxes of size 
(1/12)th inch by (1/12)th 
inch has been printed 

on top of the dendrite. 
Compare the experimental 
fractal dimension here 
with that of the dendrite in 
Figure V.152. In advance, 
which one do you expect 
will have lower fractal 
dimension? 


aa & & ; ges 
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M(A, p,€) = 


oo 
inf {mca :{A;} € A, and diam(A;) < € fori = 1 2,3... : : 
j=] 


Here we use the convention that (diam(A;))° = 0 when A; is empty. M(A, p, €) is 
a number in the range [0, oo]; its value may be zero, finite, or infinite. You should 
verify that it is a nonincreasing function of €. We now define 


M(A, p) =sup{M(A, p, €) : € > O}. 
Then for each p € [0, 00] we have M(A, p) € [0, oo]. 
Definition 4.1 Let m be a positive integer and let A be a bounded subset of the 


metric space (IR, Euclidean). For each p € [0, 00) the quantity M(A, p) described 
above is called the Hausdorff p-dimensional measure of A. 


Examples & Exercises 
4.1. Show that M(A, p) is a nonincreasing function of p € [0, oo]. 


4.2. Let A denote a set of seven distinct points in (R’, Euclidean). Show that 
M(A, 0) =7 and M(A, p) = 0 for p > 0. 


4.3. Let A denote a countable infinite set of distinct points in (R*, Euclidean). 
Show that M(A, 0) = oo and M(A, p) =0 for p > 0 


4.4. Let C denote the classical Cantor set in [0,1]. Show that M(C, 0) =o and 
M(C, 1) = 0. 

4.5. Let A denote a convenient Sierpinski triangle. Show that M(A, 1) = 00 and 
M(A, 2) = 0. Can you evaluate M(A, In(3)/1n(2))? At least try to argue why this 
might be an interesting number. 


The Hausdorff p-dimensional measure M(A, p), as a function of p € [0, oo], 
behaves in a remarkable manner. Its range consists of only one, two, or three values! 
The possible values are zero, a finite number, and infinity. In Figure V.154 we 
illustrate this behavior when A is a certain Sierpinski triangle. 


Theorem 4.1 Let m be a positive integer. Let A be a bounded subset of the met- 
ric space (IR”, Euclidean). Let M(A, p) denote the function of p € [0, 00) defined 
above. Then there is a unique real number Dy € [0, m] such that 


(oe) if p < Dy and p€ [0, 00), 


M(A. p) =| if p > Dy and p € [0, 00). 


Proof This can be found, for example, in [Federer 1969], section 2.10.3. 


Definition 4.2 Let m be a positive integer and let A be a bounded subset of 
the metric space (R”, Euclidean). The corresponding real number Dy, occurring 
in Theorem 4.1, is called the Hausdorff-Besicovitch dimension of the set A. This 
number will also be denoted by Dy (A). 
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INFINITY Figure V.154. Graph 
the function M(A, p) 
when A is a certain Sier- 
pinski triangle. It takes 
only three values. 


M(p) 


0 9 neni 1.58 2 


Theorem 4.2 Let m be a positive integer and let A be a subset of the metric 
space (R™, Euclidean). Let D(A) denote the fractal dimension of A and let Dy(A) 
denote the Hausdorff-Besicovitch dimension of A. Then 


0 < Dy(A) < D(A) sm. 


Examples & Exercises 
4.6. Describe a situation where you would expect Dy(A) < D(A). 
4.7. Prove Theorem 4.2. 


Theorem 4.3 Let m be a positive integer. Let {R”; wi, w2,..., Wn} be a hy- 
perbolic IFS, and let A denote its attractor. Let Wn be a similitude of scaling factor 
sn for each n € {1,2,3,..., N}. Uf the IFS is totally disconnected or just-touching, 
then the Hausdorff-Besicovitch dimension Dy(A) and the fractal dimension D(A) 
are equal. In fact D(A) = Dy(A) = D, where D is the unique solution of 


N 
ys snl? = 1, D € (0, m]. 
n=1 


If D is positive, then the Hausdorff D-dimensional measure M(A, Dy(A)) ts a 
positive real number. 


Proof This can be found in [Hutchinson 1981]. 
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In the situation referred to in Theorem 4.3 the Hausdorff D;,(A)-dimensional 
measure can be used to compare the “sizes” of fractals that have the same frac- 
tional dimension. The larger the value of M(A, Dy(A)), the “larger” the fractal. 
Of course, if two fractals have different fractal dimensions, then we say that the one 
with the higher fractal dimension is the “larger” one. 


Examples & Exercises 


4.8. Here we provide some intuition about the functions M(A, p,€) and M(A, p), 
and the “sizes” of fractals. We illustrate how these quantities can be estimated. The 
type of procedure we use can often be followed for attractors of just-touching and 
totally disconnected IFS whose maps are all similitudes and should lead to correct 
values. Formal justification is tedious and follows the lines suggested in [Hutchinson 
1981]. 

Consider the Sierpinski triangle A with vertices at (0, 0), (0,1), and (1,0). 
We work in R? with the Euclidean metric. We begin by estimating the number 
M(A, p, €) for p € [0, 1] for various values of €. The values of € we consider are 
Ee= /2(1/2)" for n = 0, 1, 2,3,.... Now notice that A can be covered very effi- 
ciently by 3” closed disks of radius /2(1/2)". We guess that this covering is one for 
which the infinum in the definition of M(A, p, € = /2(1 /2)") is actually achieved. 
We obtain the estimate 


M(A, p, V2(1/2)") = 3"(V2)?(1/2)”” for n = 1, 2,3,.... 


The supremum in the definition of M(A, p) can be replaced by a limit; so we 
obtain 


M(A, p) = lim {3"(V2)?(1/2)"?} 


00 if p < In(3)/In(2), 
= { (/2)n@)/In@) if p = In(3)/ In(2), 
0 if p > In(3)/In(2). 


This tells us that Dy(A) = In(3)/1n(2), which we already know from Theo- 
rem 4.3. It also tells us that M(A, Dy(A)) = (/2)"@/™G)_ This is our estimate 
of the “size” of the particular Sierpinski triangle under consideration. 

If one repeats the above steps for the Sierpinski triangle A with vertices at 
(0, 0), (0, 1//2), and (1/./2, 0), one finds M(A, Dy(A)) = 1. Thus A is “small- 
er’ than A. Similar estimates can be made for pairs of attractors of totally discon- 
nected or just-touching IFS whose maps are similitudes and whose fractal dimen- 
sions are equal. The comparison of “sizes” becomes exciting when the two attractors 
are not metrically equivalent. 


4.9. Estimate the “sizes” of the two fractals represented in Figure V.155. Which one 
is “largest”? Does the computed estimate agree with your subjective feeling about 
which one is largest? 
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4.10. Prove that the Hausdorff-Besicovitch dimension of two metrically equivalent 
bounded subsets of (R”, Euclidean) is the same. 


4.11. Let d denote a metric on R* which is equivalent to the Euclidean metric. 
Let A denote a bounded subset of R*. Suppose that d is used in place of the Eu- 
clidean metric to calculate a “Hausdorff-Besicovitch” dimension of A, denoted by 
Dy(A). Prove that Dy(A) = Dy(A). Show, however, that the “size” of the set, 
M(A, Dy(A)), may be different when computed using d in place of the Euclidean 
metric. 


4.12. If distance in R2 is measured in inches, and a subset A of R? has fractal 
dimension 1.391, what are the units of M(A, 1.391)? 
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Figure V.156. Why 
are the fractal dimen- 
sion and the Hausdorff- 
Besicovitch dimension of 
the attractor of the IFS 
represented by this image 
equal? 


4.13. The image in Figure V.156 represents the attractor A of a certain hyperbolic 
IFS. 


(1) Explain, with support from appropriate theorems, why the fractal dimension 
D and the Hausdorff-Besicovitch dimension Dy of the attractor of the IFS 
are equal. 

(2) Evaluate D. 

(3) Using inches as the unit, compare the Hausdorff-Besicovitch D-dimensional 
measures of A and w(A), where w(A) denotes one of the small “first- 
generation” copies of A. 


4.14. By any means you like, estimate the Hausdorff-Besicovitch dimension of the 
coastline of Baron von Koch’s Island, shown in Figure V.157. It is recommended 
that theoreticians try to make an experimental estimate, and that experimentalists try 
to make a theoretical estimate. 


4.15. Does the work of some artists have a characteristic fractal dimension? Make 
a comparison of the empirical fractal dimensions of Romeo and Juliet, over an 
appropriate range of scales; see Figure V.158. 


The 
middle of 
Baron von Koch's 


Island is white to 
save ink 
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Figure V.157. By any 
means you like, estimate 
the Hausdorff-Besicovitch 
dimension of the coastline 
of Baron von Koch’s 
Island. 
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Figure V.158. Does EEE EEE EEE 
the work of some artists scene ssscuceea 

have a characteristic frac- 
tal dimension? Make a 
comparison of the empir- 
ical fractal dimensions of 
Romeo and Juliet, over 
an appropriate range of 
scales. 
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Chapter VI 
Fractal Interpolation 


1 Introduction: Applications for Fractal Functions 


Euclidean geometry, trigonometry, and calculus have taught us to think about mod- 
elling the shapes we see in the real world in terms of straight lines, circles, parabolas, 
and other simple curves. Consequences of this way of thinking are abundant in our 
everyday lives. They include the design of household objects; the common usage of 
drafting tables, straight-edges, and compasses; and the “applications” that accom- 
pany introductory calculus courses. We note in particular the provision of functions 
for drawing points, lines, polygons, and circles in computer graphics software such 
as MacPaint and Turbobasic. Most computer graphics hardware is designed specifi- 
cally to provide rapid computation and display of classical geometrical shapes. 

Euclidean geometry and elementary functions, such as sine, cosine, and poly- 
nomials, are the basis of the traditional method for analyzing experimental data. 
Consider an experiment that measures values of a real-valued function F(x) as a 
function of a real variable x. For example, F(x) may denote a voltage as a function 
of time, as in the experiments on the jet-engine exhaust described in Example 3.3 in 
Chapter V. The experiment may be a numerical experiment on a computer. In any 
case the result of the experiment will be a collection of data of the form: 


Ux Fie 0 1.2. 
Here N is a positive integer, F; = F(x;), and the x;’s are real numbers such that 
AQ Ss Al Ss ADS AB SS AN 


The traditional method for analyzing this data begins by representing it graphi- 
cally as a subset of R*. That is, the data points are plotted on graph paper. Next the 
graphical data is analyzed geometrically. For example, one may seek a straight line 
segment that is a good approximation to the graph of the data. Or else, one might 
construct a polynomial of as low degree as possible, whose graph is a good fit to 
the data over the interval [xo, xy]. In place of a polynomial, a linear combination of 
elementary functions might be used. The goal is always the same: to represent the 
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Figure VI.159. Illus- 
tration of the process 
whereby experimental 
data is represented graph- 
ically and modelled geo- 
metrically by means of a 
classical geometrical en- 
tity, such as a straight line 
or a polynomial fit to the 
data. 


A STRAIGHT LINE IS A 
EUCLIDEAN APPROXIMATION 
TO THE DATA 


Xo Xn 


THE GRAPH OF A POLYNOMIAL IS A 
EUCLIDEAN APPROXIMATION 
TO THE DATA 


Xo Xn 


data, viewed as a subset of R7, by a classical geometrical entity. This entity is repre- 
sented by a simple formula, one that can be communicated easily to someone else. 
The process is illustrated in Figure VI.159. 

Elementary functions, such as trigonometric functions and rational functions, 
have their roots in Euclidean geometry. They share the feature that when their graphs 
are “magnified” sufficiently, locally they “look like” straight lines. That is, the tan- 
gent line approximation can be used effectively in the vicinity of most points. More- 
over, the fractal dimension of the graphs of these functions is always 1. These el- 
ementary “Euclidean” functions are useful not only because of their geometrical 
content but because they can be expressed by simple formulas. We can use them 
to pass information easily from one person to another. They provide a common lan- 
guage for our scientific work. Moreover, elementary functions are used extensively 
in scientific computation, computer-aided design, and data analysis because they can 
be stored in small files and computed by fast algorithms. 

Graphics systems founded on traditional geometry are effective for making pic- 
tures of man-made objects, such as bricks, wheels, roads, buildings, and cogs. This 


1 Introduction: Applications for Fractal Functions 


is not suprising, since these objects were designed in the first place using Euclidean 
geometry. However, it is desirable for graphics systems to be able to deal with a 
wider range of problems. 

In this chapter we introduce fractal interpolation functions. The graphs of these 
functions can be used to approximate image components such as the profiles of 
mountain ranges, the tops of clouds, stalactite- hung roofs of caves, and horizons 
over forests, as illustrated in Figure VI.160. Rather than treating the image compo- 


nent as arising from a random assemblage of objects, such as individual mountains, 


cloudlets, stalactites, or tree tops, one models the image component as an interre- 
lated single system. Such components are not well described by elementary func- 
tions or Euclidean graphics functions. 

Fractal interpolation functions also provide a new means for fitting experimen- 
tal data. Clearly it does not suffice to make a polynomial “least-squares” fit to the 
wild experimental data of Strahle for the temperature in a jet exhaust as a function 
of time, as illustrated in Figure V.147. Nor would classical geometry be a good tool 
for the analysis of voltages at a point in the human brain as read by an electoren- 
cephalograph. However, fractal interpolation functions can be used to “fit” such ex- 
perimental data: that is, the graph of the fractal interpolation function can be made 
close, in the Hausdorff metric, to the data. Moreover, one can ensure that the fractal 
dimension of the graph of the fractal interpolation function agrees with that of the 
data, over an appropriate range of scales. This idea is illustrated in Figure V1.161. 

Fractal interpolation functions share with elementary functions that they are of a 
geometrical character, that they can be represented succinctly by “formulas,” and 
that they can be computed rapidly. The main difference is their fractal character. 
For example, they can have a noninteger fractal dimension. They are easy to work 
with— once one is accustomed to working with sets rather than points and with 
IFS theory using affine maps. If we start to pass them from one to another, fractal 
functions will become part of the common language of science. So read on! 


Examples & Exercises 


1.1. Write an essay on the influences of Euclidean geometry on the way in which 
we view the physical world. How does fractal geometry change that view? 


1.2. Find the linear approximation /(x) to the function f(x) = sin(x), about the 
point x = 0. Let € > 0. Find the linear change of coordinates (x’, y’) = @(x, y) in 
R2, such that 6([0, €] x [0, €]) = [0, 1] x [0, 1]. Let Z’(x’) denote the function /(x) 
represented in the new coordinate system. Let f’(x’) denote the function f(x) in 
the new coordinate system. Let L denote the graph of /'(x’) for x’ € [0, 1] and let G 
denote the graph of f’(x’) for x’ € [0, 1]. How small must € be chosen to ensure that 
the Hausdorff distance from L to G is less than 0.01? The Hausdorff distance should 
be computed with respect to the Manhattan metric in R?. 
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Figure VI.160. The 
fractal interpolation func- 
tions introduced in this 
chapter may be used in 
computer graphics soft- 
ware packages to provide 
a simple means for render- 
ing profiles of mountain 
ranges, the tops of clouds, 
and horizons over forests. 


2 Fractal interpolation Functions 


Definition 2.1 A set of data is a set of points of the form {(x;, F;) € R?:i= 
0,1,2,..., N}, where 


AO AAS IDSA & i AN 


An interpolation function corresponding to this set of data is a continuous function 
f : (x0, xn] — R such that 
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Figure VI.161. Thi 
THE EXPERIMENTAL DATA den = . Be 
AND THE FRACTAL FUNCTION SURG ieee RRS SBE 
MIGHT “LOOK ALIKE” OVER of using a fractal inter- 
A RANGE OF SCALES. polation function to fit 


experimental data. The 
graph of the interpolation 
function may be close, in 
the Hausdorff metric, to 
the graph of the experi- 
mental data. The fractal 
dimension of the interpo- 
lation function may agree 
with that of the data over 
an appropriate range of 
scales. 


DATA POINTS LIE CLOSE TO 
THE GRAPH OF A FRACTAL 
INTERPOLATION FUNCTION 


7) =F; Oris i... 


The points (x;, F;) € R? are called the interpolation points. We say that the function 
f interpolates the data and that (the graph of) f passes through the interpolation 
points. 


Examples & Exercises 
2.1. The function f(x) = 1 +~ is an interpolation function for the set of data 


{(0, 1), (1, 2)}- 
Consider the hyperbolic IFS {R?; w;, w2}, where 


= (5) ° Ce xs) (;) is as) 
(2) =(3 os) (5)+("): 


Let G denote the attractor of the IFS. Then it is readily verified that G is the straight 


line segment that connects the pair of points (0, 1) and (1, 2). In other words, G is 
the graph of the interpolation function f(x) over the interval [0, 1]. 


2.2. Let {(x;, F;):i =0,1,2,..., N} denote a set of data. Let f : [xo, xy] > R 
denote the unique continuous function that passes through the interpolation points 
and is linear on each of the subintervals [x;_1, x;]. That is, 
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Figure VI.162. Graph 
of the piecewise linear 
interpolation function 

f (x) through the inter- 
polation points {(F;, x;) : 
i= 0, 1,2,3, 4}. This 
graph is also the attrac- 
tor of an IFS of the form 
{R?; w,,n = 1, 2, 3, 4}, 
where the maps are affine. 


(x — Xj-1) 
(Xj; — Xi-1) 
The function f(x) is called a piecewise linear interpolation function. The graph of 
f (x) is illustrated in Figure VI.162. This graph, G, is also the attractor of an IFS of 


iF; — Fj) for x € [x;--1, x], i = 1, 2,..., N. 


f() = Fi-i+ 


the form {R?; Wn, n =1,2,..., .N}, where the maps are affine. In fact, 
x a, O x en 
Ww = + : 
(5) s % = ES) 
where 
(Xn — Xn-1) (XnXn—-1 — X0Xn) 
- imeLy (ROME SIT SS ee 
(xn — Xo) (xn — Xo) 
F,, — F,,- F,-1 — xoF, 
i eed go Mi forn =1,2,..., N. 


(in — Ka) (xn — Xo) 

Notice that the IFS may not be hyperbolic with respect to the Euclidean metric in 
R?. Can you prove that, nonetheless, G is the unique nonempty compact subset of 
R? such that 


G = Un_Wn(G)? 


2.3. Verify the claims in exercise 2.2 in the case of the data set {(0, 0), (1, 3), (2, 0)} 
by applying either the Deterministic Algorithm, Program 1, Chapter III, or the Ran- 
dom Iteration Algorithm, Program 2 of Chapter III. You will need to modify the 
programs slightly. 


2.4. The parabola defined by f(x) = 2x — x? on the interval [0, 2] is an interpo- 
lation function for the set of data {(0, 0), (1, 1), (2, 0)} . Let G denote the graph of 
f (x). That is 


G = {(x, 2x — x”): x € [0, 2])}. 
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Then we claim that G is the attractor of the hyperbolic IFS {R?; w;, w2}, where 
a ~~ { 05....0 x 
Ly) (O05 O25) \y 7? 
wt eo 0.5 0 + . 1 
*\y}] (05 6325) ij. 


We verify this claim directly. We simply note that for all x € [0, 2], 


(sy) =(ago gue) = Gi) 
'\f@)) ~ (ads) — dx?) \ fe)’ 


wa ( . )=( L+ 5% )=( 1+3x ) 
2 f(x) ae 2(1 + $x) — (1+ 3x)? 8: f+ 4x) ; 


As x varies over [0, 2], the right-hand side of the first equation yields the part of the 
graph of f(x) lying over the interval [0, 1], while the right-hand side of the second 
equation yields the part of the graph of f(x) lying over the interval [1, 2]. Hence 
G = w,(G) U w,(G). Since G € H(IR*) we conclude that it is the attractor of the 
IFS. Notice that the IFS is just-touching. 


2.5. Find a hyperbolic IFS of the form {R*; w;, wo}, where w; and wy are affine 
transformations in R?, whose attractor is the graph of the quadratic function that 
interpolates the data {(0, 0), (1, 1), (2, 4)}. 


Let a set of data {(x;, F;):i =0,1,2,..., N} be given. We explain how one can 
construct an IFS in R? such that its attractor, which we denote by G, is the graph 
of a continuous function f : [xo, xv] — R, which interpolates the data. Throughout 
we will restrict our attention to affine transformations. The usage of more general 
transformations is discussed in [Barnsley 1988f]. 

We consider an IFS of the form {R?; w,,n =1,2,...,.N}, where the maps are 
affine transformations of the special structure 


m™()=(e aG)+(4): 


The transformations are constrained by the data according to 


Xo \_ { Xn-1 XN \_ [ Xn = 
wi? )—(* ) and w(\=(2) forn e152, 22 N. 


The situation is summarized in Figure VI.163. 
Let n € {1, 2,3, ..., N}. The transformation w, is specified by the five real num- 
bers an, Dn, Cn, Gn, and e,, which must obey the four linear equations 
AnX9 + €n =Xn-1; 
AnXN + €n = Xn, 
CnX0 + dn Fo + Sn = Fy-1, 
CnXN + d, Fn + th = 
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Fractal Interpolation 


Figure V1.163. Two il- / 
lustrations showing how 

an IFS of shear transfor- = 
mations is used to con- 

struct a fractal interpola- 
tion function. (Produced 
by Peter Massopust.) 


It follows that there is effectively one free parameter in each transformation. We 
choose this parameter to be d, for the following reason. The transformation w,, is 
a shear transformation: it maps lines parallel to the y-axis into lines parallel to the 
y-axis. Let L denote a line segment parallel to the y-axis. Then w,,(L) is also a line 
segment parallel to the y-axis. The ratio of the length of w,(L) to the length of L 
is |d,|. We call d, the vertical scaling factor in the transformation w,. By choosing 
d,, to be the free parameter, we are able to specify the vertical scaling produced by 
the transformation. With d, = 0,n = 1,2,..., N, one recovers the piecewise linear 
interpolation function. In section 6.2 we will show that these parameters determine 
the fractal dimension of the attractor of the IFS. 

Let d, be any real number. We demonstrate that we can always solve the above 
equations for a,,, C,, én, and f, in terms of the data and d,,. We find 
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a ce a Xn-1) 
a - 
_ &nXn-1 — X0%n) 
"(xy — Xo) @) 
_ (Fn = Fn-1) _ an(F — Fo) (3) 
"(xy — Xo) (xv — Xo) - 
i= (Xn Fn—-1 — X0Fn) _ : (xy Fo — X0F wn) (4) 
(xn — Xo) (xn — Xo) 


Now let {R?; w,,n =1,2,..., N} denote the IFS defined above. Let the vertical 
scaling factor d, obey 0 < d, < 1 forn =1,2,..., N. Even with this condition, the 
IFS is not in general hyperbolic on the metric space (R?,Euclidean). Despite this, let 
us see what happens if we apply the Random Iteration Algorithm to the IFS. 

Here we present Program 2 in Chapter III modified so that the input data consists 
of the interpolation points and vertical scaling factors. It is written for N = 3 and the 
data set 


{(0, 0), (30, 50), (60, 40), (100, 10)}. 


The vertical scaling factors are input by the user during execution of the code. The 
program calculates the coefficients in the shear transformations from the data, and 
then applies the Random Iteration Algorithm to the resulting IFS. The program is 
written in BASIC. It runs without modification on an IBM PC with enhanced graph- 
ics adaptor and Turbobasic. On any line the words preceded by a ’ are comments: 
they are not part of the program. 


Program 1. 
x(0O] =O : x{i] =30 : x{2] =60 : x[{3] =100 ’ Data set 


FCO] =O : F[1] =50 : F[2] =40 : F[3] =10 

*Vertical Scaling Factors 

input “enter scaling factors d(i), d(2), and d(3)", 
d(1) ,d(2) ,d(3) 

*Calculate the shear transformations from the Data 

?and Vertical Scaling Factors 


for n =1 to 3 


b = x(3])-x(0] : aln) = (x[n)-x[n-1])/b 
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e(n] = (x[(3]*x{n-1]-x[0]*x{n])/b 


c[n] 


(F (n) -F (n-1]-d{n] *(F[3]-F[0]))/b_ 

ff{n] = (x[3])*F(n-1]-x [0] *F [n)-d[n] *(x[3] *F [0] -x [0] *F[3]))/b 
next 

screen 2: cls ’ initalialize graphics 

window(0,0)-(100,100) ’change this to zoom and/or pan 


x =0: y =0 ’initial point from which the random 
7iteration begins 


for n =1 to 1000 ’Random Iteration Algorithm 
k =int (3*rnd-0.0001)+1 
newx = a([k]*x + e[k] 


c{k]*x + d{k]*y + ff [k] 


newy 
xX =newx : y =newy 

pset(x,y) ’plot the most recently computed point on the screen 
next 


end 


The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure VI.164. 
In this case d; = 0.5, d2 = —0.5, and d3 = 0.23. Notice that if the size of the plotting 
window is decreased, for example by replacing the window call by WINDOW (0,0)— 
(50,50), then a portion of the image is plotted at a higher resolution. The number of 
iterations can be increased to impove the quality of the computed image. 


Examples & Exercises 


2.6. Rewrite Program 1 in a form suitable for your own computer environment, then 
run it and obtain hardcopy of the output. 


2.7. Vary the data used by Program 1. Verify, by means of computergraphical exper- 
iments, that the corresponding IFS always seems to have a unique attractor, provided 
that the vertical scaling factors are less than 1 in norm. Verify that, provided suffi- 
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X 


ciently many points are plotted, the attractor always contains the data points, and 
that it looks like the graph of a function. 


2.8. Show that the shear transformations w,, descibed above, need not be contrac- 
tions in the Euclidean metric, even though the magnitudes of the vertical scaling 
factors are less than 1. Once you have found such an example, use Program 1, suit- 
ably modified, to obtain graphical evidence concerning the possible existence of an 
attractor. You are supposed to discover that even though the IFS is not hyperbolic in 
the Euclidean metric, it appears to possess an attractor. 


2.9. Use Program 1 to verify that the attractor of the IFS in exercise 2.4 is a 
parabola. 


We now give the theoretical basis for our experimental observations. 


Theorem 2.1 Let N be a positive integer greater than 1. Let {R*; w,,n = 
1,2,...,N} denote the IFS defined above, associated with the data set 


{(%q, Fx) im =1,2,..., N}. 


Let the vertical scaling factor d, obey 0 < d, < 1 forn=1,2,...,N.Then there is 
a metric d on R?, equivalent to the Euclidean metric, such that the IFS is hyperbolic 
with respect to d. In particular, there is a unique nonempty compact set G C R* such 


that 
G =U"_,w,(G). 


Proof We define a metric d on R* by 
d((x1, yi); (x2, y2)) — [xy St X2| 2 Oly, ang yal, 


Figure VI.164. The re- 
sult of running Program 1 
with vertical scaling fac- 
tors 0.5, —0.5, and 0.23. 
It appears that the corre- 
sponding IFS possesses 
a unique attractor that 

is the graph of a func- 
tion that passes through 
the interpolation points 
{(0, 0), (30, 50), (60, 40), 
(100, 10)}. Is there a met- 
ric such that the IFS is 
hyperbolic? 
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where @ is a positive real number, which we specify below. We leave it as an exercise 
to the reader to prove that this metric is equivalent to the Euclidean metric on R?. 
Let n € {1,2,..., N}. Let the numbers ay, Cn, en, fn, be defined by equations 1, 2, 
3, and 4. Then we have 
A(Wp(X1, V1), Wn(X2, Y2)) 
= d((GnX1 + Cn, CnX1 + dnyi + fn), (AnX2 + €n, CnX2 + dny2 + fn)) 
= An|X1 — X2| + Olen(%1 — x2) + da(yi — Y2)I 
< (lan| + Olenl)|x1 — x21 + Aldnlly1 — yal. 
Now notice that Ja,| = |X, — Xn-1|/|xn — Xo| < 1 because N > 2. If cy =cz:--= 
Cn, = 0, then we choose 6 = 1. Otherwise we choose 
_ min{(2 — Wein ee 1, 2,..., 0} 
-~ temittig.ia=i,2,...,N} 
Then it follows that 
d(wn(x1, Y1), Wn(X2, Y2)) < (lanl + Plenl)|41 — x2| + Aldally — yal 
<a|x; — x2| + 95|y1 — ye! 
< max{a, 5}d((x1, y1), (X2, y2)), 
where 
Wa a ST 2, 


CH Uti rey or To )<1, 


o= manta, 8 ST, Z...-, Ny = I. 
This completes the proof. 

Theorem 2.2 Let N be a positive integer greater than 1. Let {R?; w,z,n = 
1,2,..., N} denote the IFS defined above, associated with the data set {(Xn, Fn): 
n=1,2,..., N}. Let the vertical scaling factor d, obey 0 < d, < 1forn=1,2,..., 
N, so that the IFS is hyperbolic. Let G denote the attractor of the IFS. Then G is 


the graph of a continuous function f : [xo, xn] — R, which interpolates the data 
(x, F;) ‘4.2 1, 2, «.., N). Phatis, 


G = {(x, f(x)): x € [xo, xn]}, 
where 
f(x) = F; fori ='0,5492, 3,505 
Proof Let F denote the set of continuous functions f : [xo, x;] > R such that 
f (Xo) = Fo and f (xn) = Fy. We define a metric d on F by 
d(f, g) =max{| f(x) — g(x)|: x € [xo, xn]} for all f, g inf. 


Then (Ff, d) is acomplete metric space—see, for example [Rudin 1966], or prove it 
yourself. 


2 Fractal Interpolation Functions 


Let the real numbers a,,, Cn, én, fr, be defined by equations 1, 2, 3, and 4. Define 
a mapping T : F — F by 


(T f)(x) = Cal" (x) + da f IZ'(x)) + fa for x € [xn-1, Xn], forn =1,2,...,N, 
where |, : [xo, Xv] — [Xn-1, Xn] is the invertible transformation 
L(x) = AnX + ey. 


We verify that T does indeed take F¥ into itself. Let f € F. Then the function 
(T f)(x) obeys the endpoint conditions because 


(Tf (Xo) = cil (x0) + di f G0) + fn 
= €;X9 + d f (Xo) + fn = 1X0 + 1 Fo + fr 
= Fo 
and 
(Tf )(xw) = cnly' (xn) + dw f ly’ (xw)) + fv 
=Cyxn + dn f (xy) + fn 
= CyXn + dy Fy. 
The reader can prove that (7 f)(x) is continuous on the interval [x,_1, x,] for n = 
1,2,...,.N. Then it remains to demonstrate that (7 f)(x) is continuous at each 
of the points x), x2, x3,...,xXw-1. At each of these points the value of (Tf)(x) is 
apparently defined in two different ways. For n € {1,2,..., N — 1} we have 


(Tf) (Xn) = Sail ee + dna f UT) Xn) + fn+1 
= Cn+1X0 + noi f (Xo) + fri = Fr 
and also 
(Tf) (xn) = Gel te) - dn f IT" (tn) + fn 
= CnXn + d, f (xn) i Sn = F,, 
so both methods of evaluation lead to the same result. We conclude that T does 
indeed take F into F. 
Now we show that T is a contraction mapping on the metric space (F, d). Let 
f,g€F.Letne {l,2,...,N} and x € [x,_1, x,]. Then 
(TF) (x) — (Tg) ()| = dnl lf G7) — 97" (@&))I < Idnld Cf, 8). 
It follows that 
d(T f, Tg) < dd(f,g) where ’=max{|d,|:n=1,2,...,N} <1. 


We conclude that T : # — F is a contraction mapping. The Contraction Mapping 
Theorem implies that T possesses a unique fixed point in F. That is, there exists a 
function f € F such that | 


(Tf) (x) = f() for all x € [xo, xy]. 
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Figure V1.165. A 
sequence of functions 
{frai(x) = (Thr)(x)} 
converging to the fixed 
point of the mapping 

T : F — F used in the 
proof of Theorem 2.2. 
This is another example 
of a contraction mapping 
doing its work. 


The reader should convince himself that f passes through the interpolation points. 
Let G denote the graph of f. Notice that the equations that define T can be 
rewritten 


(T f )(GnX + €n) = Cnx + nf (x) + fn for x € [xo, xv], forn =1,2,...,N, 
which implies that 
G= UN_,wn(G). 


But G is a nonempty compact subset of R?. By Theorem 2.1 there is only one 
nonempty compact set G, the attractor of the IFS, which obeys the latter equation. It 
follows that G = G. This completes the proof. 


Definition 2.2 The function f(x) whose graph is the attractor of an IFS as 
described in Theorems 2.1 and 2.2, above, is called a fractal interpolation function 
corresponding to the data {(x;, F;):i =1,2,..., N}. 


Figure VI.165 shows an example of a sequence of iterates {7° fo: n =0, 1, 2, 3, 
...} obtained by repeated application of the contraction mapping T, introduced in 
the proof of Theorem 2.2. The initial function fo(x) is linear. The sequence con- 
verges to the fractal interpolation function f, which is the fixed point of T. Notice 
that the whole image can be interpreted as the attractor of an IFS with condensation, 
where the condensation set is the graph of the function fo(x). 

The reader may wonder, in view of the proof of Theorem 2.2, why we go to the 
trouble of establishing that there is a metric such that the IFS is contractive. After all, 
we could simply use T to construct fractal interpolation functions. The answer has 
two parts, (a) and (b). (a) We can now apply the theory of hyperbolic IFS to fractal 
interpolation functions. Of special importance, this means that we can use IFS algo- 
rithms to compute fractal interpolation functions, that the Collage Theorem can be 
used as an aid to finding fractal interpolation functions that approximate given data, 


2 Fractal interpolation Functions 


and that we can use the Hausdorff metric to discuss the accuracy of approximation of 
experimental data by a fractal interpolation function. (b) By treating fractal interpo- 
lation functions as attractors of IFS of affine transformations we provide a common 
language for the description of an important class of functions and sets: the same 
type of formula, namely an IFS code, can be used in all cases. 

One consequence of the fact that the IFS {R; w,,n = 1,2,...,N} associated 
with a set of data {(x,, F,):n =1,2,...,.N} is hyperbolic is that any set Ao € 
7H(R?) leads to a Cauchy sequence of sets {A,,} that converges to G in the Hausdorff 
metric. In the usual way we define W : H(R2) > 1(R2) by 


W(B) =U*_,w,(B) forall B € H(R?). 


Then {A, = W°"(Ao)} is a Cauchy sequence of sets which converges to G in the 
Hausdorff metric. This idea is illustrated in Figure VI.166. Notice that if Ao is the 
graph of a function fo € F then A, is the graph of T°” fo. 


Examples & Exercises 


2.10. Prove that the metric on R? introduced in the proof of Theorem 2.1 is equiva- 
lent to the Euclidean metric on R?. 


2.11. Use the Collage Theorem to help you find a fractal interpolation function that 
approximates the function whose graph is shown in Figure VI.167. 


2.12. Write a program that allows you to use the Deterministic Algorithm to com- 
pute fractal interpolation functions. 


2.13. Explain why Theorems 2.1 and 2.2 have the restriction that N is greater 
than 1. 

2.14. Let a set of data {(x;, F;):i =0, 1,2,..., N} be given. Let the metric space 
(F, d) and the transformation T : F — F be defined as in the proof of Theorem 2.2. 
Prove that if f € F then 7 is an interpolation function associated with the data. 
Deduce that if f ¢ F is a fixed point of JT then f is an interpolation function 
associated with the data. 


2.15. Make a nonlinear generalization of the theory of fractal interpolation func- 
tions. For example, consider what happens if one uses an IFS made up of nonlinear 
transformations w, : R* > R? of the form 


Wn (X,Y) = (Gnx + €n, CnX + ny + Bny? + fr); 


where a, Cn, dn, Zn, and f, are real constants. This example uses “quadratic scaling” 
in the vertical direction instead of linear scaling. Determine sufficient conditions for 
the IFS to be hyperbolic, with an attractor that is the graph of a function that inter- 
polates the data {(x;, F;):i =0, 1, 2,..., N}. Note that in certain circumstances the 
IFS generates the graph of a differentiable interpolation function. 


2.16. Let f(x) denote a fractal interpolation function associated with a set of data 
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Figure VI.166. Exam- 
ples of the convergence 
of a sequence of sets {A,} 
in the Hausdorff metric, 
to the graph of a fractal 
interpolation function. 


2 Fractal Interpolation Functions 


Figure VI.167. Use 
the Collage Theorem to 
find an IFS {R?; Wi, Ww}, 
where w; and wy» are 
shear transformations 
on R?, such that the 
attractor of the IFS is a 
good approximation to 
the graph of the function 
shown here. 


{(x;, F;):i =0,1,2,..., N}, where N > 1. Let the metric space (F,d) and the 
transformation T : F — F be defined as in the proof of Theorem 2.2. The functional 
equation Tf = f can be used to evaluate various integrals of f. As an example 
consider the problem of evaluating the integral 


I =| f(x) dx. 


The integral is well defined because f(x) is continuous. We have 


XN N Xn 
t= [ ap@dr=> [peas 


0 n=] “*n-1 
N XN 
25 (CnX + dyn f(x) + fr)d(Anx + én) =al + B, 
n=] ~ X0 
where 


N N XN 
a= (Sandy) and B=) ray | (eax + fadde. 
n=1 x0 


n=] 


Show that, under the standard assumptions, || < 1. Show also that 


fs / be 
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Figure VI.168. _Illus- eearerermeesovinen 
tration of the geometrical 
viewpoint concerning the 


integration of fractal inter- (total area) =( Det(A,)+Det (A.))«(total area) + B | 
polation functions. 


i Re 


area = Det(A,) « (total area) area = Det(A,)x (total area) 
B=area of triangle 


where fo(x) is the piecewise linear interpolation function associated with the data. 
Conclude that 


. Ea ig 
[ fa = as 


Check this result for the case of the parabola, described in Exercise 2.4. In Fig- 
ure VI.168 we illustrate a geometrical way of thinking about the integration of a 
fractal interpolation function. 


2.17. Let f(x) denote a fractal interpolation function associated with a set of data 
{(x;, F;):i =0,1,2,...,N}, where N > 1. By following similar steps to those in 
exercise 2.16, find a formula for the integral 


lez [xteax. 


Check your formula by applying it to the parabola described in exercise 2.4. 


2.18. Figure VI.169 shows a fractal interpolation function together with a zoom. 
Can you reproduce these images and then make a further zoom? What do you expect 
a very high-magnification zoom to look like? 
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Figure V1.169. A frac- 
tal interpolation function 
together with a zoom. If 
the fractal dimension is 
equal to 1, what do you 
expect “most” very high 
magnification zooms to 
look like? 


< 


3 The Fractal Dimension of Fractal Interpolation Functions 


The following excellent theorem tells us the fractal dimension of fractal interpolation 
functions. 


Theorem 3.1 Let N be a positive integer greater than 1. Let {(Xn, Fn) € R’: 
n=1,2,...,N)} be a set of data. Let {R?; w,,n =1,2,...,.N} be an IFS associ- 
ated with the data, where 


x a, O x en ae 
vn(*)=(% 2)G)+(4) torn = 1. 2, ..3 Is 


The vertical scaling factors d, obey 0 < d, < 1; and the constants ay, Cn, en, and fy, 
are given by equations 1, 2,3, and 4, forn=1,2,..., N. Let G denote the attractor 
of the IFS, so that G is the graph of a fractal interpolation function associated with 
the data. If 


N 
Yo lanl > 1 (1) 
n=1 


and the interpolation points do not all lie on a single straight line, then the fractal 
dimension of G is the unique real solution D of 
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Figure V1.170. The 
graph G of a fractal 
interpolation function is 
superimposed on a grid 
of closed square boxes 
of side length «. N(e) is 
used to denote the number 
of boxes that intersect 

G. What is the value of 
Ne)? 


= 

eet eee te 

SS Sse< SSF kee 26 =Seaeeeka 
cae? . 2S hel Ssesek Se easees 
Pe Na 
aos £2285 2s SSaSe Seva SEARS S 
ce £2SE8 5 SNR See Se eeNakee 
feof 


ae aee. 
SLaeCeeiias= x 
Beat BASSAS SARS See 21S Be Cee 


2 de Se 


n=! 


Otherwise the fractal dimension of G is I. 


Proof (Informal Demonstration). The formal proof of this theorem can be 
found in [Barnsley 1988f]. Here we give an informal argument for why it is true. We 
use the notation in the statement of the theorem. 

Let € > 0. We consider G to be superimposed on a grid of closed square boxes of 
side length ¢€, as illustrated in Figure VI.170. Let N (e€) denote the number of square 
boxes of side length € which intersect G. These boxes are similar to the ones used 
in the Box Counting Theorem, Theorem 1.2 in Chapter V, except that their sizes are 
arbitrary. On the basis of the intuitive idea introduced in Chapter V, section 1, we 
suppose that G has fractal dimension D, where 


N(e) © constant - «7? ase > 0. 


We want to estimate the value of D on the basis of this assumption. 

Let n € {1,2,..., N}. Let N,,(€) denote the number of boxes of side length «€ 
which intersect w,(G) for n = 1,2,..., N. We suppose that € is very small com- 
pared to |xy — xo|. Then because the IFS is just-touching it is reasonable to make 
the approximation 


Ne) © Nie) + Noe) + N3(€) + +++ + Nn). (2) 
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Figure VI.171. The 
boxes that intersect G can 
be thought of as organized 
in columns. The set of 
columns of boxes of side 
length € which intersect G 
is denoted by {c;(€): j = 
., K(e)}, where 
K(e) denotes the number 
of columns. What is the 
value of K(e) and how 
many boxes are there in 
C2(€), in this illustration? 


We now look for a relationship between VV (€) and N,,(€). The boxes that intersect G 
can be thought of as being organized into columns, as illustrated in Figure VI.171. 

Let the set of columns of boxes of side length € which intersect G be denoted 
by {cj(€): j =1,2,..., K(€)}, where K(€) denotes the number of columns. Under 
the conditions in equation 1, in the statement of the theorem, one can prove that the 
minimum number of boxes in a column increases without limit as € approaches zero. 
To simplify the discussion we assume that 


id, > @, for nn = 1,2, 1, 


(Notice that 


(Xn — Xn-1) 
ya =p Sneed =, 
= (xn — Xo) 
which tells us that this assumption is stronger than the assumption Bes a \d,| > 1.) 
Then consider what happens to a column ofboxes cj;(€) of side length « when we 
apply the affine transformation w,, to it. It becomes a column of parallelograms. The 
width of the column is a,¢€ and the height of the column is |d,| times the height of 
the column before transformation. Let NV’ (c;(€)) denote the number of boxes in the 
column c;(€). Then the column w,(cj;(€)) can be thought of as being made up of 
square boxes of side length a,¢, each of which intersects w,(G). How many boxes 
of side length a,¢ are there in this column? Approximately |d,|N(c(€))/an. Adding 
up the contribution to N,(an€) from each column we obtain 


Ke) d, dn K(e) d, 
Na(ae) =) ANEHED =| a Mert y=“ IN) 
j=l an ay 


The situation is illustrated in Figure VI.172. 
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Figure VI.172. When 
the shear transforma- 
tion w, is applied to the 
columns of boxes which 
cover the graph, G, the 
result is a set of thinner 
columns, of width ajé, 
which cover w;(G). The 
new columns are made up 
of small parallelograms, 
but the number of square 
boxes of side length a,é 
which they contain is read- 
ily estimated. 


From the last equation we deduce that when € is very small compared to [xo, xy], 


Nutone )forn =1,2,...,N. (3) 


We now substitute from = 3 into 2 to obtain the functional equation 
Nox SNe ae 2 < Jaa oN (= at +5 Sele 


Into this equation we substitute our assumption VV - = constant — to obtain the 
equation 


= D-1_-D 1 = saz 
6 we |d,|ayP~'e? + |dglay?'e~? + |d3lag? 1c? + --- + |dylayPe”. 


The main formula in the statement of the theorem follows at once. 

If the interpolation points are collinear, then the attractor of the IFS is the line 
segment that connects the point (xo, Fo) to the point (xy, Fy), and this has fractal 
dimension 1. If ae \d,,| < 1, then one can show that NV (e) behaves like a constant 
times €~!, whence the fractal dimension is 1. This completes our informal. demon- 
stration of the theorem. 


Examples & Exercises 


3.1. We consider the fractal dimension of a fractal interpolation function in the 
case where the interpolation points are equally spaced. Let x; = x9 + + (xn — Xo) 
for i =0,1,2,...,.N. Then it follows that a, = + forn=1,2,..., N. Hence if 
condition (1) in Theorem 3.1 holds then the fractal dimension D of the interpolation 
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function obeys 


x L py Ep a 
) |dnl 3) = (x) Do lanl = 1. 
n=1 n=1 

It follows that 


log (Sos Ian!) 
ee 


This is a delightful formula for reasons of two types, (a) and (b). (a) This formula 
confirms our understanding of the fractal dimension of fractal interpolation func- 
tions. For example, notice that )~”_, |d,| < N. Hence the dimension of a fractal 
interpolation function is less than 2: however, we can make it arbitrarily close to 2. 
Also, under the assumption that > Sek |d,| > 1, the fractal dimension is greater than 
1: however, we can vary it smoothly down to 1. (b) It is remarkable that the fractal 
dimension does not depend on the values {F; :i =0,1,2,..., N}, aside from the 
constraint that the interpolation points be noncollinear. Hence it is easy to explore 
a collection of fractal interpolation functions, all of which have the same fractal di- 
mension, by imposing the following simple constraint on the vertical scaling factors: 


N 


> lend = NP. 


n=1 


Figure VI.173 illustrates some members of the family of fractal interpolation 
functions corresponding to the set of data {(0, 0), (1, 1), (2, 1), (3, 2)}, such that the 
fractal dimension ofeach member of the family is D = 1.3. 

Figures 173 (a) and (c) illustrate members of a family of fractal interpolation 
functions parameterized by the fractal dimension D. Each function interpolates the 
same set of data. 


3.2. Make an experimental estimate of the fractal dimension of the graphical data 
in Figure VI.175. Find a fractal interpolation function associated with the data 
{(0, 0), (50, 50), (100, 0)}, which has the same fractal dimension and two equal ver- 
tical scaling factors. Compare the graph of the fractal interpolation function with the 
graphical data. 


3.3. Find a fractal interpolation function that approximates the experimental data 
shown in Figure V.147. 


3.4. Figure VI.176 shows the graphs of functions belonging to various one-para- 
meter families of fractal interpolation functions. Each graph is the attractor of an 
IFS consisting of two affine transformations. Find the IFS associated with one of the 
families. 
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Figure V1.173. Mem- 
bers of the family of frac- 
tal interpolation functions 
corresponding to the set of 
data {(0, 0), (1, 1), (2, 1), 
(3, 2)}, such that the frac- 
tal dimension of each Je 
j 
(a) = 


member of the family is 
D=13. 


RRA 


(b) 


4 Hidden Variable Fractal interpolation 229 


(c) 


4 Hidden Variable Fractal Interpolation 


We begin by generalizing the results of section 6.2. Throughout this section, let 
(Y, dy) denote a complete metric space. 


Definition 4.1 Let 1 C R. Let f : 1 — Y be a function. The graph of f is the set 
of points 


G={(x,f(x)EeRxY:xeET}. 

Definition 4.2 A set of generalized data is a set of points of the form {(x;, F;) € 

Rx Y:i=0,1,2,...,N}, where 
Xo < X%1 < XQ <XAZ< °° KN. 
An interpolation function corresponding to this set of data is a continuous function 
f :[x0, xn] — Y such that 
fad=F; fori = 1, 2... NN. 

The points (x;, F;) € R x Y are called the interpolation points. We say that the func- 


tion f interpolates the data and that (the graph of) f passes through the interpola- 
tion points. 


Let X denote the Cartesian product space R x Y. Let 0 denote a positive number. 
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Figure VI.174. Mem- 
bers of a one-parameter 
family of fractal interpo- 
lation functions. They 
correspond to the set 

of data {(0, 0), (1, 1), 

(2, 1), G, 2)} with ver- 
tical scaling factors 

d, = —d,=d3= 
for DP = 1, 1.1, 12, 

1.3, 1.4, 1.5, 1.6, and4.7. 
D is the fractal dimension 
of the fractal interpolation 
function. 
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Figure VI.176. This figure shows the graphs of various one-parameter families of fractal 
interpolation functions. Each graph is the attractor of an IFS consisting of two affine 


transformations. Can you find the families? 
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Define a metric d on X by 

d(X 1, X2) = |x; — x2| + Ody(y1, y2), (1) 
for all points X; = (x1, y,) and X2 = (x2, y2) in X. then (X, d) is a complete metric 
space. 


Let N be an integer greater than 1. Let a set of generalized data {(x;, F;) « X:i= 
0, 1,2,..., N} be given. Let n € {1,2,..., N}. Define L, : R > R by 


L,(x) = a,x + e, where a, = Vere and e, = Gn%n—1 — 0%n) (2) 
(xn — Xo) (xn — Xo) 


so that L,,([xo, xn J) = [Xn-1, Xn]. Let c and s be real numbers, with 0 < s < 1 and 
c > 0. For eachn € {1,2,..., N} let M, : X — Y bea function that obeys 


d(M,(a, y), M,(b, y)) < cla — b| for alla, be R, (3) 
and 

d(M,,(x, a), Mn(x, b)) < sdy(a, b) for alla, be Y. (4) 
Define a transformation w, : X — X by 


Wn (x, y) = (Ln(x), Mn(x, y)) for all (x, y) €X,n =1,2,...,N. 


Theorem 4.1] Let the IFS {X; wz,n =1,2,...,N} be defined as above with 
N > 1. In particular, assume that there are real constants c and s such that 0 <s < 
1, 0 <c, and conditions 3 and 4 are obeyed. Let the constant 0 in the definition of 
the metric d in equation I be defined by 


a 


where a = max{qa; :i = 1,2,..., N}. 
Then the IFS {X; wn, n = 1, 2,..., N} is hyperbolic with respect to the metric d. 


Proof This follows very similar lines to the proof of Theorem 2.1. We leave it as 
an exercise for enthusiastic readers. The proof can also be found in [Barnsley 1986]. 

We now constrain the hyperbolic IFS {X; w,,n = 1,2,..., N}, defined above, to 
ensure that its attractor includes the set of generalized data. We assume that 


M,(xo, Fo) = Fn-: and M, (xn, Fy) = F, forn = 1,2,...,N. (5) 
Then it follows that 
Wn (Xo, Fo) = (Xn-1, Fn—1) and wa(xy, Fv) = (Xn, F,) forn = 1,2,..., N. 
Theorem 4.2 Let N be a positive integer greater than 1. Let {X; w,,n= 
1,2,..., N} denote the IFS defined above, associated with the generalized data set 


{(x, Fi) € R x Y:i =1,2,..., N}. In particular, assume that there are real con- 
stants c and s such that 0 < s < 1,0 <c, and conditions 3, 4, and 5 are obeyed. Let 
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G € H(X) denote the attractor of the IFS. Then G is the graph of a continuous func- 
tion f : [xo,xn] — Y which interpolates the data {(x;, Fj):i =1,2,..., N}. That 
1S, 


G={(x, f()): x € [x, xn J}, 
where 


f(x) = F; fori aU, ft. 2, :-.. iN: 


Proof Again we refer to [Barnsley 1988f]. The proof is analgous to the proof of 
Theorem 2.2. 


Definition 4.3 The function whose graph is the attractor of an IFS, as described 
in Theorems 4.1 and 4.2, above, is called a generalized fractal interpolation function, 
corresponding to the generalized data {(x;, F;):i =1,2,..., N}. 


We now show how to use the idea of generalized fractal interpolation functions 
to produce interpolation functions that are more flexible than heretofore. The idea is 
to construct a generalized fractal interpolation function, using affine transformations 
acting on R?, and to project its graph into R*. This can be done in such a way that 
the projection is the graph of a function that interpolates a set of data {(x;, Fj) € 
R2:i=1,2,...,N}. The extra degrees of freedom provided by working in R° 
give us “hidden” variables. These variables can be used to adjust the shape and 
fractal dimension of the interpolation functions. The benefits of working with affine 
transformations are kept. 

Let N be an integer greater than 1. Let a set of data {(x;, F;) € R2:i =0, 1,2, 
..., N} be given. Introduce a set of real parameters {Hj :i =0,1,2,..., N}. For 
the moment let us suppose that these parameters are fixed. Then we define a gen- 
eralized set of data to be {(x;, F;, H;) € R x R?:i =0,1,2,..., N}. In the present 
application of Theorem 4.2 we take (Y, dy) to be ( IR*, Euclidean). We consider an 
IFS {R3?; wz, =1,2,...,N}, where for n € {1,2,..., N} the map w, : R® > R?° 
is an affine transformation of the special structure: 


x 2 2 x en 
Wn} Y}= | Cn dn In ye Sn 
Z kn [ly Mp Z 8n 


Here Gn, Cns Ans Ens Snr &n» Mins Kn» In, and m,, are real numbers. We assume that they 
obey the constraints 


X0 Xn-1 
Wn | Fo | =Wn| Fa-1 |; 
Go Gre} 


and 


233 


234 Chapter Vi Fractal Interpolation 


XN Xn 
Ue ft Py be ww, | Fy |} forn = 1,2.6.. 2N. 
Gn a. 


Then we can write 
Wn(X, Y, Z) = (Ln(x), Mn(x, y, 2)) for all (x, y,z) € R°,n =1,2,...,N, 
where L,,(x) is defined in Equation 2 and M,, : R*? > R? is defined by 


x 
a y Sn + Cnx 
: =4.[?]+(2 ee], 


where 
fle wT es lh ceal (6) 
Le Wi}, 


Let us replace F, in condition 5 by (F,, H,). Then M,, obeys condition 5. Let us 
define 


c = max{max{c;, k;}:i=1,2,..., N}. 


Then condition 3 is true. Lastly, assume that the linear transformation A, : R? > R2 
is contractive with contractivity factor s with 0 < s < 1. Then condition 4 is true. 
We conclude that, under the conditions given in this paragraph, the IFS {R?; w,, = 
1,2,..., N} satisfies the conditions of Theorem 4.2. It follows that the attractor of 
the IFS is the graph of a continuous function f : [xo, xv] > R2 such that 


f@i) = (Fj, H) fori =1,2,..., N. 
Now write 
f(x) = (fi), fo(x)). 
Then f; : [xo, xv] — R is a continuous function such that 


fi@i) = F; fori =1,2,..., N. 


Definition 4.4 The function f, : [xo, xn] —> R* constructed in the previous 
paragraph is called a hidden variable fractal interpolation function, associated with 
the set of data {(x;, F;) € R?:i =1,2,..., N}. 


The easiest method for computing the graph of a hidden variable fractal interpo- 
lation function is with the aid of the Random Iteration Algorithm. Here we present 
an adaptation of Program 1. It computes points on the graph of a hidden variable 
fractal interpolation function and displays them on a graphics monitor. It is written 
for N = 3 and the data set 


{(0, 0), (30, 50), (60, 40), (100, 10)}. 
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The “hidden” variables, namely the entries of the matrices A, and the number H,, 
for n = 1, 2,3, are input by the user during execution of the code. The program 
calculates the coefficients in the three-dimensional affine transformations from the 
data, and then applies the Random Iteration Algorithm to the resulting IFS. The first 
two coordinates of each successively computed point, which has three coordinates, 
are plotted on the screen of the graphics monitor. The program is written in BASIC. 
It runs without modification on an IBM PC with Enhanced Graphics Adaptor and 
Turbobasic. On any line the words preceded by a ’ are comments: they are not part 
of the program. 


Program 2. 


x{0] =O : x[{1] =30 : x[2] =60 : x(3] =100 ’Data set 


FCO] =O : F({i] =50 : F[2] =40 : F[3] =10 


input "enter the hidden variables H[0], H[i], H[2] and H[3]", 
H(i] ,H(2]) ,H(3] ,H([4] ’Hidden Variables 


for n = 1 to 3°: print “for n= *, 28 


input "enter the hidden variables d, h, l, m", 
d{n],hh[n],1[n],m[n] ’More Hidden Variables 


next 


for n = 1to3 ’Calculate the affine transformations from 
the Data and the Hidden Variables 


F [n-1]-d[n]+*F (0]-hh[n]*H(0] : q = H{n-1]-1{n)*F(0]-m[n] *H[0] 


a 
it 


F (n] -d(n)] *F [3] -hh[n]*H[3] : s = H[{n)-1[n] *F(3)-m[n] *H[3] 


| 
I] 


b 


x(3)-x[0] : c{n] = (r-p)/b : kin] = (s-q)/b 


a(n] = (x(n]-x[m-1])/b : efm) = (x[3]*x[n-1]-x(0]*x[n])/b 


ff[n] = p-c[n]*x(0] : g{n] = q-k{n)*x[0] 
next 
screen 2: cls ’initalialize graphics 


window(0,0)-(100,100) ’change this to zoom and/or pan 
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x =0: y =0: z =hh(0) ’initial point from which the random 
iteration begins 
for n = 1 to 1000 ’Random Iteration Algorithm 
kk = int (3*rnd-0.0001)+1 


a(kk]*x + e[kk] 


‘ 
a 


newy c(kk]*x + d[kk]*y + hh[kk]*z + ff [kk] 


newz = k({kk]*x + 1{kk]*y + m[{kk]*z + g[kk] 
x = newx : y = newy: z = newz 


pset(x,y),z ’plot the most recently computed point, in 
color z, on the screen 


next 


end 


The result of running an adaptation of this program on a Masscomp workstation 
and then printing the contents of the graphics screen is presented in Figure VI.177. In 
this case H[0] = 0, H[1] = 30, H[2] = 60, H[3] = 100, d(1) = d(2) = d(3) = 0.3, 
h(1) = h(2) = 0.2, h(3) = 0.1, 101) = 1(2) =1(3) = —0.1, m(1) = 0.3, m(2) =0, 
m(3) = —0.1. Remember that the linear transformation A, must be contractive, so 
certainly do not enter values of magnitude larger than 1 for any of the numbers 
d(n), h(n), l(n), and m(n). The program renders each point in a color that depends 
on its z-coordinate. This helps the user to visualize the “hidden” three-dimensional 
character of the curve. 

The important point about hidden variable fractal interpolation is this. Although 
the attractor of the IFS is a union of affine transformations applied to the attractor, 
this is not the case in general when we replace the word “attractor” by the phrase 
“projection of the attractor.” The graph of the hidden variable fractal interpolation 
function f;(x) is not self-similar, or self-affine, or self-anything! 

The idea of hidden variable fractal interpolation functions can be developed using 
any number of “hidden” dimensions. As the number of dimensions is increased, the 
process of specifying the function becomes more and more onerous, and the function 
itself, seen by us in flatland, becomes more and more random. One would never 
guess, from looking at pictures of them, that they are generated by deterministic 
fractal geometry. 
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Examples & Exercises 
4.1. Generalize the proof of Theorem 2.1 to obtain a proof of Theorem 4.1. 
4.2. Let F denote the set of continuous functions f : [xo, xv] — Y such that 
f (xo) = Fo and f (xn) = Fy. 
Define a metric d on F by 
d(f, g) =max{dy(f (x), g(x)) : x € [xo, xn]}- 


Then (F, d) is a complete metric space; see, for example [Rudin 1966]. Use this fact 
to help you generalize the proof of Theorem 2.2 to provide a proof of Theorem 4.2. 


4.3. Rewrite Program 2 in a form suitable for your own computer environment, then 
run it and obtain hardcopy of the output. 


4.4. Modify your version of Program 2 so that you can adjust one of the “hidden” 
variables while it is running. In this way, make a picture that shows a one-parameter 
family of hidden variable fractal interpolation functions. 


4.5. Modify your version of Program 2 so that you can see the projection of the 
attractor of the IFS into the (y, z) plane. To do this simply plot (y, z) in place of 
(x, y). Make hardcopy of the output. 


4.6. Figure VI.178 shows three projections of the graph G of a generalized fractal 
interpolation function f : [0, 1] > R*. The projections are (i) into the (x, y) plane, 
(ii) into the (x, z) plane, and (iii) into the (y, z) plane. G is the attractor of an IFS of 
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Figure V1.177. An ex- 
ample of a hidden vari- 
able fractal interpola- 
tion function. This graph 
was computed using Pro- 
gram 2 with the follow- 
ing “hidden” variables: 
A[(0] =0, A[1] = 30, 
H[2] = 60, H[3] = 100, 
d(1) = d(2) = d(3) = 
0.3, h(1) = h(2) = 0.2, 
A{3) = 0.1, 11) =i) = 
1(3) = —0.1, m(1) = 0.3, 
m(2) = 0, m(3) = —0.1. 
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the form {R?; w;, w2}, where w and w2 are affine transformations. Find w, and wp. 
See also Figure VI.179. 


4.7. Use a hidden-variable fractal interpolation function to fit the experimental data 
in Figure V.147. Here is one way to proceed. (a) Modify your version of Program 
6.4.1 so that you can adjust the “hidden” variables from the keyboard. (b) Trace 
the data in the Figure V.147 onto a sheet of flexible transparent material, such as a 
viewgraph. (c) Attach the tracing to the screen of your graphics monitor using clear 
Sticky tape. (d) Interactively adjust the “hidden” variables to provide a good visual 
fit to the data. 


4.8. «Show that, with hidden variables, one can use affine transformations to con- 
struct graphs of polynomials of any degree. 


5 Space-Filling Curves 


Here we make a delightful application of Theorem 4.2. Let A denote a nonempty 
pathwise-connected compact subset of R*. We show how to to construct a continu- 
ous function f :[0, 1] > R?* such that f([0, 1]) = A. 

Let (Y, dy) denote the metric space (R*,Euclidean). We represent points in Y us- 
ing a Cartesian coordinate system defined by a y-axis and a z-axis. Thus, (y, Z) may 
represent a point in Y. To motivate the development we take A =m C Y. Consider 
the just-touching IFS {Y; w,, w2, w3, w4}, where the maps are similitudes of scaling 
factor 0.5, corresponding to the collage in Figure VI.180. 

Let 

(Fo, Ho) = (0, 0), (F;, A) = (0, 0.5), (F2, H2) = (0.5, 0.5), 
(F3, 3) co (1, 0.5), and 
(F4, Hg) = (1, 0). 
The maps are chosen so that 
Wn(Fo, Ho) = (Fn-1, An-1) and w,(F4, H4) = (F,, H,) for n = 1, 2, 3, 4. 


The IFS code for this IFS is given in Table VI.1. 

Let Ao € 7{(m) denote a simple curve that connects the point (Fo, Ho) to the point 
(F4, Hq), such that Ag N Om = {(Fo, Ho), (F4, H4)}. This last condition says that the 
curve lies in the interior of the unit square box, except for the two endpoints of the 
curve. Consider the sequence of sets {A, = W™" (Ao) } 25 where W : H(@) > H(m) 
is defined by 


W(B) = U%_,w,(B) for all B € H(m). 


It follows from Theorem 7.1 in Chapter III that the sequence converges to m in 
the Hausdorff metric. The reader should verify that, for eachn = 1,2,...,A, isa 
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Figure VI.178. This 
figure shows three pro- 
jections of the graph 
of a generalized frac- 
tal interpolation function 
f : [0,1] > R?. The pro- 
jections are into the (x, y) 
plane, the (x, z) plane, and 
the (y, z) plane. G is the 
TY 12 attractor of an IFS of the 
x x form {R?; w,, w2}, where 
w, and wz are affine trans- 
formations. Can you find 
w, and w? 


TY 
2 


Figure VI.179. Three 
orthogonal projections of 
the graph of a general- 
ized fractal interpolation 
function. The fractal di- 
mension here is higher 
than for Figure VI.178. 
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Figure VI.180. Col- 

lage of M@ using four simil- 

itudes of scaling factor 

0.5. The map w,, is cho- 

sen so that w,(Fo, Ho) = 

(F,-1 H,-1) and 

wWr( Fa, Hy) = (Fi, A) for 

n=1,2,3,4. (F3, H3) 


(Fo, Ho) (F4, Hq) 


Table Vi.1. IFS code form, constrained to yield a space-filling curve. 


w a b c d ée 2 P 


ee OS. -05. 0 0 0 0.25 
y ae |. eee 0 05 0 05. OD) 
3 -05.—0 0 0.5---0.5--05..0,25 
4 0 05 -05 0O 1 0.5 0.25 


simple curve that connects the points (Fo, Ho) to the point (F4, H4). Sequences of 
such curves are illustrated in Figures VI.181—VI.184. 

We use the IFS defined in the previous paragraph to construct a continuous func- 
tion f : (0, 1] > msuch that f([0, 1]) =m. We achieve this by exploiting a hidden 
variable fractal function constructed in a special way. We use ideas presented in 
Chapter VI, section 4. Consider the IFS {R°; w,,n =1,2,..., N}, where the map 
w,, : R? — R? is the affine transformation 


x 025 0° @ x (n — 1)/4 
wiv imt CO 2 eh. y{t+ Cn for n € {1, 2, 3, 4}. 
z 0 Cn an Zz tn 


The constants a,, Dn, Cn, dn, €n, and f, are defined in Table VI.1. This IFS satisfies 
Theorem 4.2, corresponding to the set of data 


{(0, Fo, Ho), (0.25, Fi, Hi), (0.5, Fo, Hz), (0.75, F3, Hs), (1, Fa, H4)}. 


It follows that the attractor of the IFS is the graph, G, of a continuous function 
f :[0, 1] + R*. What is the range of this function? It is 
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Figure VI.183. A se- 
quence of sets “converging 
to” al. These are obtained 
by application of the De- 
terministic Algorithm to 
the IFS code in Table VI.1, 
starting from the set Ao in 
the lower left panel. How 
fascinating they are! 


Figure VI.184. A se- 
quence of curves “‘con- 
verging to” a space-filling 
curve. These are obtained 
by application of the De- 
terministic Algorithm to 
the IFS code in Table VI.1, 
starting from a curve Ao, 
which connects (0, 0) to 
(1, 0) and lies in @. 


Pet ol eee 
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Gy, = {(y,z) € R?: (x, y, z) € G}, 


namely the projection of G into the (y, z) plane. It is straightforward to prove that 
Gy, is the attractor A = m of the IFS defined by the IFS code in Table VI.1. It follows 
that f([0, 1]) =m. So we have our space-filling curve! 

We have something else very exciting as well. The attractor of the three-dimen- 
sional IFS is the graph of a function from [0, 1] tom. The projections G,y and G,,, in 
the obvious notation, are graphs of hidden-variable fractal functions, while Gy, = =. 
What does G look like from other points of view? Various views of the attractor are 
illustrated in Figures VI.185 and VI.186. We conclude that G is a curious, complex, 
three-dimensional object. It would be wonderful to have a three-dimensional model 
of G made out of very thin strong wire. 

The following theorem summarizes what we have just learned. 


Theorem 5.1 Let A C R? be anonempty pathwise-connected compact set, such 
that the following conditions hold. Let N be an integer greater than 1. Let there be 
a hyperbolic IFS {R?; M,,n =1,2,..., N} such that A is the attractor of the IFS. 
Let there be a set of distinct points {(F;,G;) € A:i =0,1,2;,..., N} such that 


M, (Fo, Ho) = (Fn-1, Hn-1) and wWn( Fw, Hy) = (Fr, Ai,) forn = ee ee ,N 


Then there is a continuous function f : [0,1] —> R? such that f ({0, 1]) = A. One 
such function is the one whose graph is the attractor of the IFS 


1 n—1l 
{R?; w,(x,y, z) = (x + ——, Ma(y, 2). n= 1,2,..., N}. 


N 


Examples & Exercises 


5.1. Let A denote the Sierpinski triangle with vertices at the points (0, 0), (0, 1), 
and (1, 0). Find an IFS of the form {R*; w;, w2, w3}, where the maps are affine, such 
that the attractor of the IFS is the graph of a continous function f : [0, 1] —~ R? such 
that f([0, 1]) = A. Four projections of such an attractor are shown in Figure VI.187. 


5.2. Find an IFS {R?; w;, w2, w3, ws}, where the transformations are affine, whose 
attractor is the graph of a continuous function f : [0, 1] > R? such that f aU, 1) = 
A, where A is the set represented in Figure VI. 188. 
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Figure VI.185. Vari- 
ous views of the attractor 
of a certain IFS. From 
some points of view we 
see that it is the graph of a 
function. From one point 
of view it is the graph of a 
space-filling curve! 


Figure VI.186. Higher- 
resolution view of the 
lower right panel of Fig- 
ure VI.185. 
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Figure VI.188. Find an 
IFS {R°*; wy, wo, ws, ws}, 
where the transforma- 
tions are affine, whose 
attractor is the graph of 

a continuous function 

f :[0, 1] > R? such that 
Ff ((0O, 1]) = A, where A is 
the set represented here. 
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Figure VI.187. Four 
views of the attractor of 
an IFS. This attractor is 
the graph of a continuous 
function f : [0, 1] > R? 
such that f({0, 1]) is 

a Sierpinski triangle. 
This function provides 

a “space-filling” curve, 
where space is a fractal! 


Chapter VII 
Julia Sets 


1 The Escape Time Algorithm for Computing Pictures of IFS Attractors 
and Julia Sets 


Let us consider the dynamical system {R*; f}, where f : R? — R? is defined by 
(2x,2y—1) if y>0.5, 
f(x,y)= 4 (Qx-1,y) ifx>0O5 andy <0.5, 
(2x, 2y) otherwise. 


This dynamical system is related to the IFS {R?; w;, w2, w3}, where 


w(x, y) = (0.5x, 0.Sy + 0.5), 

w(x, y) = (0.5x + 0.5, 0.5y), 

w3(x, y) = (0.5x, 0.5y)}. 
The attractor of the IFS is a Sierpinski triangle A with vertices at (0,0), (0, 1), 
and (1,0). The relationship between the dynamical system {R7; f} and the IFS 
{R?; w;, w2, w3} is that {A; f} is a shift dynamical system associated with the IFS. 
(Shift dynamical systems are discussed in Chapter IV, section 4.) One readily verifies 
that f restricted to A satisfies 


w; (x,y) if (x, y) € w:(A) \ {(0, 0.5), (0.5, 0.5)}, 
f(x, y)=4 wy'(x, y)_ if (x, y) € wa(A) \ {(0.5, 0)}, 
w; (x, y) if (x, y) € w3(A). 


In particular, f maps A onto itself. The dynamical system {R?; f} is an exten- 
sion of the shift dynamical system {A; f} to R*. The situation is illustrated in Fig- 
ure VII.189. 

Let d denote the Euclidean metric on R*. The shift dynamical system {R?; f} 
is “expanding”: for any pair of points x;, x2 lying in any one of the three domains 
associated with f, we have 


d(f (x1), f (x2)) = 2d (x1, x2), 
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In this region f(xy)=w7 (xy) 


S) x hs x 
NWN 


NWS : 


MN 


Here f(x,y) =w3 (xy) In this region f(x,y) =w5! (xy) 


One can prove that the orbit {f°"(x)}°°, diverges toward infinity if x does not 
belong to A. That is 


d(O, f°"(x)) > oo as n — oo for any point x € R? \ A. 


What happens if we compute numerically the orbit of a point x € A? Recall that 
the fractal dimension of A is log(3)/ log(2). This tells us that A is “thin” compared 
to R*. Hence, although f(A) = A, errors in a computed orbit are likely to produce 
points that do not lie on A. This means that, in practice, most numerically computed 
orbits will diverge, regardless of whether or not the initial point lies on A. The 
Sierpinski triangle A is an “unstable” invariant set for the transformation f : R? > 
R2. It is a “repulsive” fixed point for the transformation f : H(R*) > H(R?). It is 
an attractive fixed point for the transformation W : 71(IR*) > (IR), where W = 
w, U w2 U w; is defined in the usual manner. 


Figure VII.189. The 
dynamical system {R?, f} 
is obtained by extending 
the definition of a shift 
dynamical system on a 
Sierpinski triangle to all 
of R?. 
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Figure VII.190. How 
long do orbits of points in 
W take to arrive in V? We 
expect that the number of 
iterations required should 
tell us something about the 
structure of A. 


Intuitively, we expect that orbits of the dynamical system {R*; f} that start close 
to A should “take longer to diverge” than those that start far from A. How fast do 
different orbits diverge? Here we describe a numerical, computergraphical experi- 
ment to compare the number of iterations required for the orbits of different points 
to escape from a ball of large radius, centered at the origin. Let (a, b) and (c, d), re- 
spectively, denote the coordinates of the lower left corner and the upper right corner 
of a closed, filled rectangle W C R*. Let M denote a positive integer, and define an 
array of points in W by 

(c — a) (d —b) 

re b+q —— 

In the experiment these points will be represented by pixels on a computer graphics 
display device. We compare the orbits {f°"(xp,q) :}?2) for p,q =0,1,2,..., M. 

Let R be a positive number, sufficiently large that the ball with center at the origin 
and radius R contains both A and W. Define 


V={(x, y) €R:x? + y* > R}. 


Xpg =(at+p ior p, ag = 0, 1,2, ...,M. 


A possible choice for the rectangle W and the set Y, in relation to A, is illustrated 
in Figure VII.190. In order that the comparison of orbits provides information about 
A, one should choose W so that WN A+ @. 

Let numits denote a positive integer. The following program computes a finite set 
of points 


(F°° (pg) F°p.g)s £° Xpag)s +--+ fF" (%p.g)} 
belonging to the orbit of x», € W, for each p,q =1,2,..., M. The total number 
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of points computed on an orbit is at most numits. If the set of computed points of the 
orbit of x,,, does not include a point in V when n =numits, then the computation 
passes to the next value of (p, q). Otherwise the pixel corresponding to x, q is ren- 
dered in a color indexed by the first integer n such that f°"(x,,,) € V, and then the 
computation passes to the next value of (p, g). This provides a computergraphical 
method for comparing how long the orbits of different points in W take to reach V. 

The program is written in BASIC. It runs without modification on an IBM PC 
with Enhanced Graphics Adaptor and Turbobasic. On any line the words preceded 
by a’ are comments: they are not part of the program. 


Program 1. ((Example of the Escape Time Algorithm)) 
numits=20: a=0 : b=0: c=1 : d=i1 : M=100 ’Define viewing 


window, W, and numits. 


R=200 ’Define the region V. 

screen 9: cls ’Initialize graphics. 
for p=1i to M 

for q=1i to M 


x = a+ (c-a)*p/M : y = b + (d-b)*q/M ’Specify the initial 


point of an orbit, x(p,q). 


for n=1 to numits ’Compute at most numits points on the orbit 
of x(p,q). 


if y > 0.5 then ’Evaluate $f$ applied to the 
previous point on the orbit. 


% @ 2% 3 y *@ oy 4 
elseif x > 0.5 then 
?THE FORMULA FOR THE 
x = 2x - 1: y = 2*y FUNCTION f(x) 
else 
x = 2*x : y = 2*y 
end if . 


150 if x*x + y*y > R then ?If the most recently computed 
point lies in )V then... 
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160 pset(p,q),n : n = numits ’...render the pixel x(p,q) 
in color n, and go to the next 


(p,q). 

170 end if 

if instat then end ’Stop computing if any key is pressed! 
next n : next q : next p 


end 


Color Plate 6 shows the result of running a version of Program 1 on a Masscomp 
5600 workstation with Aurora graphics. 

In Figure VII.191 we show the result of running a version of Program 1, but 
this time in black and white. A point is plotted in black if the number of iterations 
required to reach V is an odd integer, or if the orbit of the point does not reach V 
during the first numits iterations. 

In Figure VII.192 we show the result of running a version of Program 1, with 
(a, b) = (0, 0), (c, d) = (5 x 10718, 5 x 1078), and numits = 65. This viewing win- 
dow is minute. See also Color Plate 7. Now you should be convinced that A is not 
simplified by magnification. 

The dynamical system {R*; f} contains deep information about the “repelling” 
set A. Some of this information is revealed by means of the Escape Time Algorithm. 
The orbits of points that lie close to A do indeed appear to take longer to escape from 
R? \ V than those of points which lie further away. 


Examples & Exercises 


1.1. Let {R?, f} denote the dynamical system defined at the start of this chapter 
and let A denote the associated Sierpinski triangle. Prove that the orbit { f°"(x)} 25 
diverges, for each x € R?2 \ A. That is, prove that d(O, f°"(x)) > oo asn — ov for 
each x € R? \ A. 


1.2. Rewrite Program 1 in a form suitable for your own computergraphical envi- 
ronment, then run it and obtain hardcopy of the output. 


1.3. Ifthe Escape Time Algorithm is applied to the dynamical system 
{R?; f(x, y) = (2x, 2y)}, 
what will be the general appearance of resulting colored regions? 


1.4. By changing the window size in Program 1, obtain images of “zooms” 
on the Sierpinski triangle. For example, use the following windows: (0, 0) — 
(0.5, 0.5); (0, 0) — (0.25, 0.25); (0, 0) — (0.125, 0.125); .... How must the total 
number of iterations, numits, be adjusted as a function of window size in order 
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that (approximately) the quality of the images remains uniform? Make a graph of 
the total number of iterations against the window size. Is there a possible relation- 
ship between the behavior of numits as a function of window size, and the fractal 
dimension of the Sierpinski triangle? Make a hypothesis and test it experimentally. 


Here we construct another example of a dynamical system whose orbits “‘try to 
escape” from the attractor of an IFS. This time we treat an IFS whose attractor has a 
nonempty interior. Consider the hyperbolic IFS {IR?; w1, w2}, where 


H[3]-[~ o I5]+[0] 
mis tote % |[-12h1 


and s = 72. 


The attractor of this IFS is a closed, filled rectangle, which we denote here by 
m. This attractor is the union of two copies of itself, each scaled by a factor 1/ J/2, 
rotated about the origin anticlockwise through 90°, and then translated horizontally, 
one copy to the left and one to the right. The inverse transformations are 


“E)-[2 Jb} Lr] = 


Define f : R? > R? by 


wi (x, y) when x >0 
w5'(x,y) whenx <0. 


finn={ 


Then the dynamical system {R*; f} is an extension of the shift dynamical system 
{m; f} to R*. : 
What happens when we apply the Escape Time Algorithm to this dynamical 
system? To see, one can replace the function f(x) in Program 1 by 
if x > 0 then 


newx = s*y : newy = -s*x + s 
else 
newx = s*y : newy = -s*x - s 
>THE FORMULA FOR THE 
end if FUNCTION f(x) 


xX = newx : y = newy 
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Results of running Program 1, thus modified, with the window W and the escape 
region V chosen appropriately, are shown in Figure VII.193. 

It appears that the orbits of points in the interior m do not escape. This is not 
suprising. The fractal dimension of the attractor of the IFS is the same as the fractal 
dimension of R, so small computational errors are unlikely to knock the orbit off 
the invariant set. It also appears that the orbits of points that lie in R? \ m reach V 
after fewer and fewer iterations, the farther away from m they start. 

Again we see that the Escape Time Algorithm provides a means for the compu- 
tation of the attractor of an IFS. Indeed, we have here the bare bones of a new algo- 
rithm for computing images of the attractors of some hyperbolic IFS on R*. Here are 
the main steps. (a) Find a dynamical system {R*; {} which is an extension of a shift 
dynamical system associated with the IFS, and which tends to transform points off 
the attractor of the IFS to new points that are farther away from the attractor. (This 
is always possible if the IFS is totally disconnected. The tricky part is to find a for- 
mula for f(x), one which can be input conveniently into a computer. In the case of 
affine transformations in R*, one can often define the extensions of the domains of 
the inverse transformations with the aid of straight lines.) (b) Apply the Escape Time 
Algorithm, with V and W chosen appropriately, but plot only those points whose nu- 
merical orbits require sufficiently many iterations before they reach V. For example, 


Figure VII.193. An 
image of an IFS attrac- 
tor computed using the 
Escape Time Algorithm. 
This time the attractor of 
the IFS is a filled rectangle 
and the computed orbits of 
points in @ seem never to 
escape. 
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Figure VII.194. Im- 5 
ages of an IFS attractor 
computed using the Es- 
cape Time Algorithm. 
Only points whose orbits 
have not escaped from 
R \ V after numits iter- 
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ations are plotted. The es 
value for numits must be oes nee 
chosen not too large, as in ee : 


(a); and not too small, as 
in (b); but just right, as in 
(c). 


(c) 


in Program 1 as it stands, one can replace the three lines 150, 160, and 170 by the 
two lines 
150 if n = numits then pset(p,q),1 


160 if x*x + y*y > R then n = numits 
and define numits = 10. If the value of numits is too high, then very few points will 
not escape from W and a poor image of A will result. If the value of numits is too 
low, then a coarse image of the A will be produced. An image of an IFS attractor 
computed using the Escape Time Algorithm, modified as described here, is shown 
in Figure VII.194. 

Color Plates 8—12 show the results of applying the Escape Time Algorithm to the 
dynamical system associated with various hyperbolic IFS in R?. In each case the 
maps are affine, and the shift dynamical system associated with the IFS has been 
extended to R?. 
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Examples & Exercises 


1.5. Modify your version of Program 1 to compute images of the attractor of the 
IFS {C3 w)(z) = rez — 1, w2(z) = rez + 1}, when r = 1//2 and 6 = 77/2. 


1.6. Show that it is possible to define a dynamical system {C; f} which extends to 
C the shift dynamical system associated with the IFS 


{C; wy(z) = rez — 1, w2(z) = rez + 1}, 


for any 0 € [0, 277), provided that the positive real number r is chosen sufficiently 
small. Note that this can be done in such a way that f is continuous. 


1.7. Let {A; f} denote the shift dynamical system associated with a totally discon- 
nected hyperbolic IFS in R*. A denotes the attractor of the IFS. Show that there are 
many ways to define a dynamical system {R?; g} so that f(x) = g(x) forallx eA. 


The Escape Time Algorithm can be applied, often with interesting results, to any 
dynamical system of the form {R?; f},{C; f}, or {C ; J}. One needs only to specify 
a viewing window W and a region V, to which orbits of points in W might escape. 
The result will be a “picture” of W wherein the pixel corresponding to the point z is 
colored according to the smallest value of the positive integer n such that f°"(z) € V. 
A special color, such as black, may be reserved to represent points whose orbits do 
not reach V before (numits +1) iterations. 

What would happen if the Escape Time Algorithm were applied to the dynamical 
system f: C > C defined by f(z) =z*? This transformation can be expressed 
f (x, y) = (x? — y’, 2xy). From the discussion of the quadratic transformation in 
Chapter III, section 4, we know that the orbits of points in the complement of the 
unit disk F = {z €¢ C: |z| < 1} converge to the point at infinity. Orbits of points in 
the interior of F converge to the origin. So if W is a rectangle that contains F 
and if the radius R, which defines VY, is sufficiently large, then we expect that the 
Escape Time Algorithm would yield pictures of F surrounded by concentric rings of 
different colors. The reader should verify this! 

F is called the filled Julia set associated with the polynomial transformation 
f(z) = 22. The boundary of F is called the Julia set of f, and we denote it by 
J. It consists of the circle of radius 1 centered at the origin. One can think of J 
on the Riemann Sphere as being represented by the equator on a globe. This Julia 
set separates those points whose orbits converge to the point at Infinity from those 
whose orbits converge to the origin. Orbits of points on J itself cannot escape, 
either to infinity or to the origin. In fact J € H(C) and f(J) = J = f7'(J). It is 
an “unstable” fixed point for the transformation f : H(C) > H(C). 

Definition 1.1 Let f :C > C denote a polynomial of degree greater than 1. 
Let F denote the set of points in C whose orbits do not converge to the point at 
infinity. That is, 


Fr ={zEC: {| f°" (z)|}pao is bounded }. 


i= 
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Figure VII.195. Illus- ais Ball around oO 
tration showing what is SPHERE o @) 


going on in the proof 
of Theorem 1.1. This il- 
lustrates the increasing 
sequence of sets {V,} of 
the point at infinity. It also 
shows the decreasing se- 
quence of sets, K,,, the 
complements of the lat- 
ter, which converge to the 
filled Julia set Fy. In gen- 
eral the origin, O, need 
not belong to F;. 


J is the 
boundary of F 


go to the North Pole - 
F 


~ 
ws 


This set is called the filled Julia set associated with the polynomial f . The boundary 
of Fr is called the Julia set of the polynomial f , and it is denoted by J. 


Theorem 1.1 Let f :C — C denote a polynomial of degree greater than 1. 
Let Fy denote the filled Julia set of f and let J denote the Julia set of f. Then Fs 
and J are nonempty compact subsets of C; that is, Ff € H(C) and Js € H(C). 
Moreover, f (Js) = Jz = f~\(Jp) and f (Fs) = Fr = f7'(Fy). The set Voo = C \ 
F ¢ is pathwise-connected. 


Proof We outline the proof for the one-parameter family of transformations f, : 
C — C defined by 


Alara, where A € C is the parameter. 


The general case is treated in [Blanchard 1984], [Brolin], [Fatou 1919-20], and 
[Julia 1918], for example. This outline proof is constructed to provide information 
about the relationship between the theorem and the Escape Time Algorithm. Some 
of the ideas and notation used here are illustrated in Figure VII.195. 

Let J, denote the Julia set for f, and let F, denote the filled Julia set for f,. Let d 
denote the Euclidean metric on C and let 


R>05+ J0.25 + |Al. 
Then it is readily verified that 
d(O, f(z)) > d(O, z) for all z such that d(O, z) > R. 
Define 
V={zeEC: |z| > R} U {oo}. 
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Then it follows that 
FY) CY. 


One can prove that the orbit { f°"(z)} converges to oo for all z € V. No bounded orbit 
intersects V. It follows that 


F,={zeCc: f°"(z) ¢ V for each finite positive integer n}. 


That is, F, is the same as the set of points whose orbits do not intersect V. 
Now consider the sequence of sets 


i ee eae 62 forn=0,1,2,.... 


For each nonnegative integer n, V, is an open connected subset of (C, Spherical). 
VY, is open because V is open and f is continuous. V,, is connected because of the 
geometry of the quadratic transformation, described in Chapter III, section 4: The 
inverse image of a path that joins the point at infinity to any other point on the sphere 
is a path that contains the point at infinity. 

Since f(V) C V it follows that V C f~!(V). This implies that 


VoMeg Vice GW: GY, Cee. (1) 


For each nonnegative integer n, 


Vn ={ze€: fz, £"@), £2@) F8@s + MOOV AGI. 


That is, V,, is the set of points whose orbits require at most n iterations to reach V. 
Let 


Kn =C\ Vp forn =0, 1,2,3,.... 


Then K,, is the set of points whose orbits do not intersect VY during the first n 
iterations. That is, 


K,= {z EC: {z, f(z), f(z), F(z)... F"™@INVE= @| 


For each nonnegative integer n, K,, is a nonempty compact subset of the metric space 
(C,,Spherical). How do we know that K,, is nonempty? Because we can calculate that 
f possesses a fixed point z; € C, by solving the equation 


f@p=%y-A=zy. 


The orbit of zs converges to z¢. Hence it cannot belong to V, for any nonnegative 
integer n. Hence zy € K,, for each nonnegative integer n. 
Equation 1 implies that 


| ees Tee y Cee Coe ee ee 


It follows that {K,,} is a Cauchy sequence in H (C). It follows that {K,,} converges to 
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a point in H(C ). The limit is the set of points whose orbits do not intersect V. Hence 


Fr= im KS tka: 


and we deduce that F belongs to H(C). 
The equation 


Katt = f° (RK) forn=0,1,2,... 
now implies, as in the proof of Theorem 4.1, that 
Fy = f°-"(Fi). 
Applying f to both sides of this equation, we obtain 
FCF) = Fi. 


Let us now consider the boundary of F,, namely the Julia set J, for the dynamical 
system {C; fy}. Let z €interior(F;). Then the continuity of f implies f Ser aes 
interior(F;,). Hence F, > f~!(@F,) D 8F,. Now suppose that z € f~'(9F,). Let O 
be any open ball that contains z. Since f is analytic, f(O) is an open ball, and it 
contains f(z) € 0F,. Hence f(©) contains a point whose orbit converges to the 
point at infinity. It follows that O contains a point whose orbit converges to the 
point at infinity. Thus f~'(9F,) C F,. We conclude that f~'(@F,) = 0F, and in 
particular that f(@F,) = 0F,. This completes the proof of the theorem. 


We summarize some of what we discovered in the course of this proof. The 
filled Julia set F, is the limit of a decreasing sequence of compact sets. Its comple- 
ment, which we denote by V., is the limit of an increasing sequence {V,,} of open 
pathwise-connected sets in (C, spherical). That is, 


Veo —_ lim Vn = 5 ae 
n—> oo 


The latter is called the basin of attraction of the point at infinity under the polyno- 
mial tranformation f,. It is connected because each of the sets V, is connected. We 
have 


Voo is open, connected, and nonempty. F, is compact and nonempty. 

The Escape Time Algorithm provides us with a means for “seeing” the filled Julia 
sets F,, as well as the sequences of sets {V,} and {K,,} referred to in the theorem. 
Let us look at what happens in the case A = 1.1. Define V by choosing R = 4, and 
let W = {(x, y): -2 <x <2, —2 < y < 2}. The function fj=1.1:C — C is given 
by the formula 


fai, y= a y? — 1.1, 2xy) for all (x, y) EC. 


An example of the result of running the Escape Time Algorithm, with V, W and 
f :C > C thus defined, is shown in Figure VII.196. The black object represents 
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the filled Julia set F,~).;. The contours separate the regions V,4; \ V,, for some 
sucessive values of n. These contours also represent the boundaries of the regions 
K,, referred to in the proof of the theorem. We refer to them as escape time contours. 
Points in V,4; \ V, have orbits that reach Y in exactly (n + 1) iterations. In Color 
Plate 13 we show another example of running the Escape Time Algorithm to produce 
an image of the same set. The regions V,,,; \ V, are represented by different colors. 

Figure VII.197 shows a zoom on an interesting piece of F,,~),, including parts of 
some escape time contours. This image was computed by choosing W to be a small 
rectangular subset of the window used in Figure VI1I.196. 

Figures VII.198(a)—(e) shows pictures of the filled Julia sets F, for a set of real 
values of A. These pictures also include a number of the escape time contours, to 
help indicate the location of F,,. Fo is a filled disk. As A increases, the set becomes 
more and more pinched together until, when A = 2, it is the closed interval [—2, 2]. 
For some values of A € [0, 2], it appears that F;, has no interior, and is “tree-like”; for 
other values it seems to possess a roomy interior. It also appears that F, is connected 
for all A € [0, 2], and totally disconnected when A > 2. In the latter case F, may be 
described as a “Cantor-like” set, or as a “dust.” The transition between the totally 
disconnected set and the connected, bubbly set as the parameter A is varied reminds 
us of the transition between the the Cantor set and the Sierpinski triangle, discussed 
in connection with Figure IV.118. 


Examples & Exercises 


1.8. Modify your version of the Escape Time Algorithm to allow you to com- 
pute pictures of filled Julia sets for the family of quadratic polynomials z* — A for 
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Figure VII.196. The 
Escape Time Algorithm 
provides us with a means 
for “seeing” the filled 
Julia sets F,, as well 

as the sequences of sets 
{V,} and {K,,} referred 
to in Theorem 1. In this 
illustration, A = 1.1. The 
black object represents 
the filled Julia set Fy.) 1. 
The contours separate the 
regions V,_; \ V,, for 
some successive values 
of n. These contours also 
represent the boundaries 
of the regions K,, referred 
to in the proof of the 


: theorem. 
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Figure VII.197. Zoom 
in on an interesting piece 
of Figure VII.196. 


complex values of A. Compute a picture of the filled Julia set for A =i and obtain 
hardcopy of the output. 


1.9. Give the iteration formulas, and find a suitable value for R in terms of |A|, so 
that the Escape Time Algorithm can be applied to the complex polynomial z>—A. 


1.10. Study web diagrams associated with x? — A, for increasing values of A € 
[0, 3]. Speculate on the relation of these diagrams to the corresponding filled Julia 
sets. 


1.11. Let A € [0, 0.7] U [0.8, 1.2]. Let V to be an open ball of radius 0.00001 
centered at the origin. Run the Escape Time Algorithm to the dynamical system 
{C, z* — A} with this choice of V. Obtain computergraphical data in support of the 
hypothesis that, in this case, the algorithm yields approximate pictures of pieces of 
the closure of C \ F,. Design an escape region Y so that, for A € [0, 0.7] U [0.8, 1.2], 
the Escape Time Algorithm yields approximate pictures of J). 


1.12. The Escape Time Algorithm introduces numerical errors in the computation 
of orbits. These errors should lead to inaccuracies in the computed pictures of Julia 
sets and IFS attractors. Consider the application to the filled Julia set for z* — 1. By 
means of computergraphical experiments, determine the importance of these errors 
in the images you compute. One way to proceed is to choose successively smaller 
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Figure VII.198. (a)-(e) 
A sequence of Julia set im- 
ages, as in Figure VII.196, 
for an increasing sequence 
of values A in the range 

0 to 3. In (d) and (e) the 
filled Julia set is the same 
as the Julia set: the filled 
Julia set has no interior, so 
it equals its boundary. In 
(d) the Julia set is “tree- 
like.” In (e) the Julia set is 
totally disconnected. 


Figure VII.198. (b) 
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Figure VII.198. (d) 


Figure VII.198. (c) 


1 The Escape Time Algorithm 263 


Figure VII.199. This 
image was computed 
by applying the Es- 
cape Time Algorithm 
to the dynamical sys- 
tem {C; f(z) =z*-z- 
0.78}. The viewing win- 
dow is W={(x, y): 
—1<x<1,-l<y<l}. 
Can you determine the 
escape region V? 


windows W, which intersect the apparent boundary of the filled Julia set, and to 
seek the window size at which the quality of computed images seems to deteriorate. 
(You will need to increase the maximum number of iterations, M, as you zoom.) 
Can you give evidence to show that the apparently deteriorated images are not, in 
fact, correct? 


1.13. Figure VII.199 was computed by applying the Escape Time Algorithm to the 
dynamical system {C; f(z) = z* — z — 0.78}. The viewing window is W = {(x, y): 
—1<x <1,—1<,y <1}. Determine the escape region V. Also, you might like to 
try magnifying one of the little faces in this image. 


1.14. The images in Figure VII.200 (a),(b),(c), and (d) represent the nontrivially 
distinct attractors of all IFS of the form 


{m; W1, W2, w3}, 


where the maps are similitudes of scaling factor one-half, and rotation angles in the 
set {0°, 90°, 180°, 270°}. The three translations (0, 0), (1, 0), and (0, 1) are used. 
These IFS are all just-touching. For i # j the set w;(A) Nw; (A) is contained in one 
of the two straight lines x = 1 or y = 1. Show that, as a result, it is easy to compute 
these images using the Escape Time Algorithm. 

Here are some observations about this “group” of images. Many of them contain 
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Figure VII.200. (a)- 
(d) The images in (a), 
(b), (c), and (d) represent 
the nontrivially distinct’ 
attractors of all IFS of 
the form {M; w,, w2, w3}, 
where the maps are simil- 
itudes of scaling factor 
one-half, and the rota- 
tion angles are in the 

set {0°, 90°, 180°, 270°}. 
The three translations 

(0, 0), (1,0), and (0, 1) 
are used. These IFS are all 
just-touching. For i 4 j 
the set w;(A) MN w;(A) is 
contained in one of the 
two straight lines x = 1 or 
y = 1. Hence it is easy to 
compute images of these 
attractors using the Escape 
Time Algorithm. 


Figure VII.200. (b) 
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Figure VII.200. (c) 


Figure VII.200. (d) 
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straight lines. They all have the same fractal dimension. They all use approximately 
the same amount of ink. Many of them are connected. Make some more observa- 
tions. Can you formalize and prove some of these observations? 


1.15. Verify computationally that a “snowflake” curve is a basin boundary for the 
dynamical system {R*; f}, where for all (x, y) € R?, 


f(x, y) = (0, —1) if y < 0; 

f (x, y) = 3x, 3y) if y>Oandx < Sy //3 + 1; 

f (x, y) =((9 — 3x — 3V3y)/2, 3V3 — 33x + 3)/2) 
ify>0O and —y/V¥3+1<x <3/2; 

f (x, y) = ((3x — 3V3y)/2, 3V3x + 3y — 6V3)/2)/2), 
ify>0O and 3/2<x <y/V3+42:; 

f(x, y) = (9 — 3x, 3y), if y>0, and x > y/V3 +2. 


Iterated Function Systems Whose Attractors Are Julia Sets 


In section 1 we learned how to define some IFS attractors and filled Julia sets with 
the aid of the Escape Time Algorithm applied to certain dynamical systems. In this 
section we explain how the Julia set of a quadratic transformation can be viewed as 
the attractor of a suitably defined IFS. 

The Escape Time Algorithm compares how fast different points in W escape to 
Y, under the action of a dynamical system. Which set repels the orbits? From where 
do the escaping orbits originate? In the case of the dynamical systems considered at 
the start of section 1, orbits were “escaping from” the attractor of the IFS. 

Let A € C be fixed. Which set repels the orbits, in the case of the dynamical sys- 
tem {C: f,(z) =z” — A}? To find out let us consider the inverse of f,(z). This is 
provided by a pair of functions, f~'(z) = (+. /z +A, —/z +A}, where, for exam- 
ple, the positive square root of a complex number is that complex root that lies on 
the nonnegative real axis or in the upper half plane. Explicitly, /z = /x; + ix: = 
(a(x1, x2), b(x1, X2)) with 


2 2 
XV 4+X5 +X 
a(x, X42) = {> Sa when x2 > 0, 
2 2 
Xe +5 +X 
a(x1,xX2)=- = when x2 < 0, 
x? 4x2 — xy 
30 


b(X1, x2) = 
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To find the “repelling” set, we must try to run the dynamical system backwards. This 
leads us to study the IFS 


(C; wi(z) = Vz +A, w2(z) = —Vz $A}. 


The natural idea is that this IFS has an attractor. This attractor is the set from which 
points try to flee, under the action of the dynamical system {C; 2° =X}: 

A few computergraphical experiments quickly suggest a wonderful idea: they 
suggest that the the IFS indeed possesses an attractor, namely the Julia set J, = 
dF, for f,(z). Consider, for example, the case A = 1. Figure VII.201 (a) illustrates 
points in the window W = {z= (x, y) € C:—-2 <x <2, —2 < y < 2} whose orbits 
diverge. It was computed using the Escape Time Algorithm. Figure VII.201 (b) 
shows the results of applying the Random Iteration Algorithm to the above IFS, with 
). = 1 and the same screen coordinates, superimposed on (a). The boundary of the 
region F,; is outlined by points on the attractor of the IFS. 

Figures VII.202(a)-(d) show the results of applying the Random Iteration Algo- 
rithm to the IFS {C; wi(z) =VZ+A, w2(z) = —Vz+A} for various A € [0, 3]. In 
all cases it appears that the IFS possesses an attractor, and this attractor is the Julia 
set J. 

Perhaps 


{C; w)(z) = Vz +A, wo(z) = —Vz +A} 


is a hyperbolic IFS with J, as its attractor? No, it is not, because € = w,(C) U 
w>(C). The IFS is not associated with a unique fixed point in the space H(C). In 
order to make the IFS have a unique attractor, we need to remove some pieces from 
C, to produce a smaller space on which the IFS acts. 


Figure VII.201. (a) 
and (b) The attractor 

of the IFS {C; w;(z) = 
JV/z+1, w2(z) = -—Vz+ 1} 
is the Julia set for the 
transformation f(z) = 

z2 — 1. (a) illustrates 
points whose orbits “es- 
cape” when the Escape 
Time Algorithm is ap- 
plied. (b) shows the results 
of applying the Random 
Iteration Algorithm to the 
IFS, superimposed on (a). 
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Figure VII.201. (b) 


Theorem 2.1 Let i € C. Suppose that the dynamical system {C; f(j=2—-A} 
possesses an attractive cycle {2}, 22, 23,.-., Zp} C C. Let € be a very small positive 


number. Let X denote the Riemann Sphere C with (p + 1) open balls of radius € 
removed. (The radius is measured using the spherical metric.) One ball is centered at 
each point of the cycle, and one ball is centered at the point at infinity, as illustrated 
in Figure VII.203. Define an IFS by 


{X; wi(z) =VZ+A, wo(z) = —V z+ A}. 
Then the transformation W on H(X), defined by 
W(B) = w)(B) U w2(B) for all B € H(X), 


maps H(X) into itself, continuously with respect to the Hausdorff metric on H(X). 

Moreover W : H(X) — H(X) possesses a unique fixed point, J, the Julia set for 
2 

z- — A. Also 


lim: W°"(B) = J, for all B € H(X). 


These conclusions also hold if the orbit of the origin, { f°"(O)}, converges to the 
point at infinity, and X = C \ B(oo, €). 


Sketch of proof The fact that W takes H(X) continuously into itself follows 
from Theorem 4.1. To apply Theorem 4.1, three conditions must be met. These 
conditions are (i), (ii) and (iii), stated next. f is analytic on C so (i) it is continuous, 
and (ii) it maps open sets to open sets. The way in which X is constructed ensures 
that, for small enough e, (iii) f(X) D X. (The latter implies W(X) = f~!(X) c X.) 

To prove that W possesses a unique fixed point we again make use of Theo- 
rem 4.1. Consider the limit A € 7((X) of the decreasing sequence of sets, {W°"(X)}, 
namely, 

A=N&, f° (X) = Jim W°"(X). 
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Figure VII.202. (a) 
(d) The results of apply- 
ing the Random Itera- 
tion Algorithm to the IFS 
(C: Jr ti, Jet) 
for various values of 

d € [0, 3]. Compare these 
images with those in Fig- 
ure VII.198. The results 
are pictures of the Julia 
set for f,(z) =z? —A. The 
connection between these 
Julia sets and IFS theory is 
revealed! 


Figure VII.202. (b) 


Figure VII.202. (c) 


270 + # Chapter VII 


Figure VII.202. 


(d) 


Riemann Sphere C with a 


Julia Sets 


one Settee 


Figure VII.203. The Infinity 


number of very small open 
balls of radius € removed. 
One ball is centered at 
each of the points {z, € C} 
belonging to an attractive 
cycle of the transformation 
f,(z) = 27 — A. One ball 
is centered at the point at 
infinity. 


This obeys W(A) = A. It follows from [Brolin 65], Lemma 6.3, that A = J,, the 
Julia set. This completes the sketch of the proof. 


Theorem 2.1 can be generalized to apply to polynomial tranformations f : 
CC of degree N greater than 1. Here is a rough description: let f~!(z) = 
{w1(z), w2(z), ..., Wn(z)} denote a definition of branches of the inverse of f. Then 
consider the IFS {C: w1(Z), W2(Z), ..., Wy(Z)}. This IFS is not hyperbolic: the “typ- 
ical” situation is that the associated operator W : H(O) > H(©) possesses a finite 
number of fixed points, all except one of which are “unstable.” The one “stable” 
fixed point is Jr, and W°"(A) — Jy for “most” A € H(C). In principle, J can be 
computed using the Random Iteration Algorithm. 
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Results like Theorem 2.1 are concerned with what are known as hyperbolic Julia 
sets. The Julia set of a rational transformation f : C > Cis hyperbolic if, whenever 
c € C is a critical point of f, the orbit of c converges to an attractive cycle of f. 
The Julia set for z7 — 0.75 is an example of a nonhyperbolic Julia set. We refer to 
[Peitgen 1986] as a good source of further information about Julia sets from the 
dynamical systems’ point of view. 

Explicit formulas for the inverse maps, {w,(z):n =1,2,..., N}, for a polyno- 
mial of degree N, are not generally available. So the Random Iteration Algorithm 
cannot usually be applied. Pictures of Julia sets and filled Julia sets are often com- 
puted with the aid of the Escape Time Algorithm. The case of quadratic transfor- 
mations is somewhat special, because both algorithms can be used. The Random 
Iteration Algorithm can also be applied to compute Julia sets of cubic and quar- 
tic polynomials, and of special polynomials of higher degree such as 2" + A where 
n=5,6,7,...,andda €C. 


Examples & Exercises 


2.1. Consider the dynamical system (C; f(z) = 27}. The origin, O, is an at- 
tractive cycle of period 1: indeed f(O) = O and | i '(O)| =0 < 1. Notice that 
LiMn +00 f(z) = O for all z € B(O, 0. 99999999). Let B (z, r) denote the open ball 
on C, with center at z and radius r. Theorem 2.1 tells us that the IFS 


{X = € \ {B (O, 0.0000001)U B (00, 0.0000001)}; wi (z) = /z,w2(z) = — v2} 


possesses a unique attractor. The attractor is actually the circle of radius 1 centered at 
the origin. It can be computed by means of the Random Iteration Algorithm. Notice 
that if we extend the space X to include v, then O € H(X) and O= W(O) = 
w1(O) U w2(O). If we extend X to include B (O, 0.0000001), then the filled Julia 
set Fo belongs to H(X) and obeys Fo = W(Fo). If we take X to be all of C then 
C= w(C). In other words, if the space on which the IFS acts is too large then 
uniqueness of the “attractor” of the IFS is lost. 

Can you find two more nonempty compact subsets of C that are fixed points of 
W, in the case X = C? 

Establish that, for all A € (—0.25, 0.75), the point z = 0.5 — /0.25 + A is an 
attractive cycle of period 1 for (C; z* — A}. Deduce that the corresponding IFS, 
acting on a suitably chosen space X, possesses a unique attractor. 


2.2. Let 4 € (0.75, 1.25). Consider the dynamical system {C; f(z) =z” — A}. Let 
z1,Z) € R denote the two solutions of the equation z” + z + (1 — A) = 0. Show 
f(z) = 2, f (2) = 215 CF) (zd = CF )(@2)| < 1 and hence that {z1, z2} is an 
attractive cycle of period 2. Deduce that the IFS 
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(C\ {B (21, €)UB (22, €)U B (00, €)}; +Vz +4, —Vz FA} 
possesses a unique attractor when € is sufficiently small. 


2.3. The Julia set J, for the polynomial z* — A is a union of two “copies” of 
itself. Identify these two copies for various values of 1. Explain how, when A = 1, 
the two inverse maps w; '(z) and w, '(z) rip the Julia set apart, and the set map 
W = w, U w> puts it back together again. Where is the rip? Describe the geometry 
of what is going on here. 


2.4. Consider the one-parameter family of polynomials f(z) =z? — A, where A € 
C is the parameter. Give explicit formulas for the real and imaginary parts of three 
inverse functions w;(z), w2(z), and w3(z) such that f—!(z) = {w (z), w2(z), w3(z)} 
for all 4 ¢ C. Compute images of the filled Julia set for f(z) for 1 = 0.01 and 
A = 1. Compare these images with those obtained by applying the Random Iteration 
Algorithm to the IFS {C; w;(z), w2(z), w3(z)}. 


2.5. Consider the dynamical system {C; f(z) = z2 — A} for 4 > 2. Show that 
{ f°”(O)} converges to the point at infinity. Deduce that the IFS 


{X = C \ B(oo, €); +Vz +A, -Vz +A} 


possesses a unique attractor A(A). A(A) is a generalized Cantor set. Compute some 
pictures of A(3). Use the Collage Theorem to help find a pair of affine transforma- 
tions w; : R > R,i = 1, 2, such that the attractor of the IFS {R; w;, w2} is an ap- 
proximation to A(3). Define f : R > R by f (x) = w,'(x) when x <Oand f(x) = 
Wo '(x) when x > 0. Compare the graphs of the functions f(x) = x” — 3 and f(x) 
for, say, x € [—4, 4]. Compare one-dimensional “images” obtained by applying the 
Escape Time Algorithm in a similar manner to both {R; f} and {R; f}. 


One can sometimes obtain a hyperbolic IFS associated with a Julia set, if the do- 
mains and ranges of the inverse transformations are defined carefully. The following 
theorem provides such an example. 


Theorem 2.2 Let d € [—0.249, 0.749], and let « be a very small positive num- 
ber. Let a =0.5 — /0.25 +4, an attractive fixed point of the dynamical system 
{C; f(z) =z? —A). Let X=C \ (B (a, €)U B (oo, €) U (0, co)}. That is, X consists 
of the Riemann Sphere with a small open ball centered at a, a small open ball cen- 
tered at 00, and the open interval (0, 00), removed. (This space is not compact be- 
cause the edges of the lips of the cut, from O to oo, are missing.) To each lip let there 
be attached copies of the pieces of the real interval (0, 00) which were removed, to 
provide a compact space X, as illustrated in Figure VII.204. The distance d(z, z2) 
between a pair of points z, and z) € X is the length (measured using the spherical 
metric) of the shortest path that lies in X and connects z, to 22. (Paths in X cannot 
cross the cut, they have to go around it.) (X, d) is a compact metric space. 

Define w,: X > X by w\(z) = Vz +A, the root that lies in the “upper half 
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THE RIEMANN SPHERE 
AFTER SURGERY 


“UPPER HALF PLANE” 


Both edges of the 


small ball centered cut are copies of [0,9] 


at a removed 


—Infinity 


Cut along the 
positive real axis 
with two lips. 


“LOWER HALF PLANE” 


plane.” For z on the upper edge of the cut, w,(z) is also on the upper edge. For z 
on the lower edge of the cut, w;(z) lies on the negative real axis. Define w2 : X > X 
by w2(z) = —V/z +A, the root that lies in the “lower half plane.” For z on the the 
upper edge of the cut, w2(z) lies on the negative real axis; and, for z located on the 
lower edge of the cut, w2(z) lies on the lower edge of the cut. 

Then there is a metric on X, equivalent to the metric d, such that the IFS 
{X; w1, w2} is hyperbolic. The attractor is the Julia set J, for z* — i, where the 
real point 0.5 + /0.25 + A is repeated on both the upper and the lower edges of the 
cut. 


Sketch of proof Let e = (e), €2,..., én, ...) € Z, the code space on the two 
symbols {1, 2}. Define a sequence of nonempty compact subsets of X by 


Xp (€) = We, O We, O We; O We, O We, O We, 0+ 0 We, (X) forn = 1, 2, 3,.... 


It follows, using [Brolin], Theorem 6.2, and Lemma 6.3, that the sequence {X, € 
H(X)} converges to a singleton, say {p(e)}, where @(e) € J, and 


Ucexh(e) = Jh. 


A beautiful theorem of Elton [Elton 1988] applies under just these conditions and 
provides the conclusion of the theorem. This completes the outline of the proof. 


In those situations where the IFS {X; +/z + A, —Vz + A} is hyperbolic one can 
use the associated code space to discuss both the Julia set and the associated shift 
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Figure VII.204. Con- 
struction of a compact 
metric space X for IFS 
{X;/z +A, —J/z +A} 
with A € (—0.25, 0.75), 
used in Theorem 2.2. 

Both sides of the “slit” 
from —A to infinity belong 
to the space. The distance 
between a pair of points 
on X is the length of the 
shortest path that connects 
the points without cross- 
ing the slit. The distance 
between points may be 
much greater than it looks. 
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dynamical system {J,; f(z) =z” — A}. Here we give some of the flavor of such a 
discussion. More details can be found in [Barnsley 1984]. 

For the remainder of this section let A € (—0.25, 0.75) and consider the IFS 
{X; w1(z) = /z +A, wo(z) = —JV/z +A}, as defined in Theorem 2.2. Let © de- 
note the code space on the two symbols {1,2}, and let @: & — J, denote the 
associated code space map, introduced in Theorem 2.1 in Chapter IV. If e= 
(€1, €2,.-..,@n,---) € b, then 


o(e) = im: We, O We, O We, O We, O We, O We, O+** O We (Z). 


Replace the symbol “1” by the symbol “+” and replace the symbol “2” by the 
symbol ““—”. Then the point @(e) on the Julia set J, can be represented by the 


formula 
Ae5 Aeoy e7 eg oe CnV See 


where e; € {+, —} for each positive integer i. The set J, itself can be represented by 
the collection of formulas 


+ JAt [A+ JAtJaAtyatyatyat--tva--., (1) 


where all possible sequences of plus and minus signs are permitted. A particular 
sequence of signs, corresponding to a point in J,, is an address of the point. 

In Figure VII.205 we show the Julia set for z* — 0.7, with the addresses of various 
points marked on it. Some points on Jo,7 have multiple addresses while others have 
single addresses. It appears that the the IFS is just-touching. 

The shift dynamical system associated with the IFS is {J,; f(z) = z* — A}. Notice 
how the set of points represented by the formulas in equation 1 is mapped into itself 
by the function that “squares” a formula and subtracts A from the result. A point on 
a cycle of period 2 is represented by 


~~ 


~ The other point on this cycle is obtained by squaring the formula and subtracting i. 


In Theorem 2.4 in Chapter IV, we learned that the set of periodic points of the 
shift dynamical system associated with a hyperbolic IFS is dense in the attractor of 
the IFS. Here this tells us that the set of periodic points of the dynamical system 
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The “positive” root lies 
in the upper half plane 


The branch cut is on 


le of peri ~~ 
cycle of period 2 the positive real axis. 


Figure VII.205. The Julia set for z* — 0.7, labelled with various addresses. Chaotic 
dynamics takes place on the Julia set and orderly dynamics takes place off it. Boundaries 
of a fractal character often separate regions where the dynamical system behaves differently. 
The behavior of the dynamical system on such a boundary may then be indecisive, and in 
some way chaotic. 


{J,; f(z) =2* — A} is dense in J,. In fact, a related idea was the starting point of Ju- 
lia’s original investigations. He considered dynamical systems of the form {C; Ft (z)}, 
where f(z) is analytic. He defined the (Julia) set to be the closure of the set of repul- 
sive cycles of f. 

Following Theorem 8.1 in Chapter IV we explained the sense in which the shift 
dynamical system associated with a hyperbolic IFS is chaotic. In the present context 
we learn that the dynamical system {J,; z” — A} is chaotic. 

One can think of the dynamical system {C; z? — A} as being the union of two 
dynamical systems, a chaotic one {J4; z? — A} and an orderly one {C \ Ay 2 
4}. The orbit of any point in the latter system converges to a fixed point of the 
transformation. The orbits of “most” points in the former system are wild. In practice 
they are usually so wild they cannot be constrained to remain on the repelling set J. 
They escape and thereafter behave in a rather predictable manner. 

An example of chaotic dynamics on a Julia set is provided by the dynamical 
system {[0, 1]; f(x) =4x(1 — x)}. The interval (0, 1] is exactly the Julia set for the 
transformation. This system is close to the “chaotic” one illustrated in Figure IV. 108. 
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Examples & Exercises 


2.6. The Julia set for z? — 2 is the interval [—2, 2]. Show that the shift dynamical 
system associated with the IFS {{—2, 2]; +./z + 2, — Vz + 2} is precisely the dy- 
namical system {[—2, 2]; z — 2}. Use a chain of square roots to locate a cycle of 
minimal period 3. 


2.7. Verify numerically that for various choices of + on each Square root, and 
for various complex numbers A such that |A| is very small, the expression below 
evaluates approximately to a complex number that lies on the unit circle centered 
at the origin, if enough square roots are taken. (+./z means the solution, w, of the 
equation w? = z, which lies either on the nonnegative real axis or in the upper half 
plane.) Make a hand-waving explanation of why this is, in terms of Julia set theory. 


2.8. Design an IFS with condensation such that its attractor looks like an infinite 
nested chain of square root signs. 


2.9. Figure VII.206 represents a sequence of sets {A,} which converges to the 
Julia set of f(z) =z” — 1. Ao denotes the union of the two largest faces and A, = 
f°” (Ap). Identify the set Ap. 


3 The Application of Julia Set Theory to Newton’s Method 


We are familiar, since our first course in calculus, with Newton’s method for com- 
puting solutions of the equation F (x) = 0. Or are we? 

Consider the polynomial F(z) = z+ — 1 for z € C. There are four distinct com- 
plex numbers, a; (i = 1, 2, 3, 4) such that F(a;) = 0. These are called the roots, or 
the zeros, of the polynomial F(z). Newton’s method provides a means to compute 
them. Pretend that we do not know that a, = 1, a, = —1, a3 =1, and ag = —i. Then 
Newton tells us to consider the dynamical system 

A F(z) 

{C; f(z) =z FQ) }. 
We call f(z) the Newton transformation associated with the function F(z). The 
general expectation is that a typical orbit { f°"(zp)}, which starts from an initial 
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Figure VII.206. Let 
7: C > C denote the 
polynomial z? — 1. Let 
Ao be the union of the 
two largest smiley faces. 
Define a transforma- | 
tion W : H(C) > H(C) 
by W(B) = f~'(B) for 
all B € H(C) and let 
A, = W°"(Ao) for n = 
0,1,2,.... Then the se- 
quence of sets {A,} con- 
verges to the Julia set 
z? — 1. Can you identify 
A2? 


“guess” Z € C, will converge to one of the roots of F(z). In the present example, 
the Newton transformation is given by 
324 +1 
2). 
fO= 
We expect the orbit of zo to converge to one of the numbers a), a2, 43, or a4. If we 
choose Zp close enough to a;, then it is readily proved that 


lim f°" (zo) = di, fori =-1, 2, 3, 4. 
n—> co 


If, on the other hand, Zp is far away from all of the a;’s, then what happens? Perhaps 
the orbit of z) converges to the root of F(z) closest to 29? Or perhaps the orbit does 
not settle down, but wanders, hopelessly, forever? 

Let us make a computergraphical experiment to help answer these questions. 
We use the Escape Time Algorithm to produce a picture of those points zo € C 
whose orbits converge to a,. Define W = {(x, y)€ C: -2 <x <2,-2< y <2} 
and Y = {z € C: |z — a;| < 0.0001}. The real and imaginary parts of f(x + iy) are 
given by 


(cet+df) .(de-—cf) 


PRA DRE apy MA OE fy 
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where a = x” — y*, b = 2xy, c = 3a” — 3b* +1, d = 6ab, e = 4(xa — yb), and f = 
4(xb + ya). Program 7.1.1 is modified accordingly. Pixels corresponding to points 
in W whose orbits reach VY in less than a fixed number of iterations are plotted. 
A picture resulting from such an experiment is shown in Figure VII.207. See also 
Figures VII.208 and VII.209. 

Color Plate 14 shows the output from another such experiment. This time Mer- 
cator’s projection is used to represent the Riemann sphere, and points whose orbits 
converge to the different points a), a2, a3, and a4 are plotted in different colors. 

The following definition is equivalent to Definition 1.1 in the case of polynomials, 
[Brolin]. 


Definition 3.1 The Julia set of a rational function f : C > C, of degree greater 
than I, is the closure of the set of repulsive periodic points of the dynamical system 
(C; f). 

For the rational function f(z) considered above, one can prove that the Julia set J 
is the same as the set of points whose orbits do not conyerge to any one of the points 
Qa}, 42, a3, a4. In Figure VII.207, J MN W is represented by the boundary between 
the black and white regions. In Color Plate 14, J M W is the place where the four 
colors meet. The complement of the Julia set consists of four open sets, the basins 
of attraction of the four attractive fixed points of the Newton iteration scheme. In 
Color Plate 14 the dark green region represents part of the basin of attraction of a. 
The black regions in the color plate are caused by (a) rounding errors, and (b) the fact 
that only 100 points on each orbit are tested, for convergence to one of the points a, 
a2, A3, a4. 

The Julia set J is the part of C on which chaotic dynamics occurs. It can be 
characterized as the closure of the set of points whose orbits wander, hopelessly, 
forever. Orderly, slightly boring motion takes place on C \ J. J is the boundary of 
the white region. It is the boundary of the dark green region. It is a bonafide fractal, 
yet nobody knows its fractal dimension. 

There is a beautiful theorem of Sullivan, which can be illustrated using the 
“petals” in Color Plate 14. The complement of the Julia set is the union of a count- 
able collection of connected open sets, which we call petals. If P is a petal then 
f (P) is another petal. The Non-Wandering Domain Theorem [Sullivan 1982] says 
that no connected component of the complement of the Julia set wanders, hope- 
lessly, forever. It always settles into a periodic orbit of petals. If P is a petal in 
the present example, then one can prove that there is a positive integer S so that 
f°5(P) ice foS*D(P) ihn fOS*2)(P) am ; mane a —.... The final petal f°(P) is 
one of the connected components of the complement of the Julia set that contains 
one of the points a), a2, a3, a4. Each petal is eventually periodic. The orbit of a petal 
ends up in a cycle of petals of period 1. 

How are we to think about this fabulous Julia set? IFS theory provides a simple 
point of view, as we show next. We begin by defining the inverse map associated 
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Figure VII.207. The 
Escape Time Algorithm 
is applied to analyze 
Newton’s method for 
finding the complex roots 
of the polynomial z* — 1. 
The boundary of this 
region represents the 
Julia set for the rational 
function f(z) = (3z* + 
1)/4z*. The points plotted 
black are those points z = 
x +iy with —2 <x <2, 
and —2 < y < 2, whose 
orbits intersect V = {z € 
C:|z+1| < 0.0001} in 
less than 1,000 iterations. 


Figure VII.208. The 
boundary of this region 
represents the Julia set 
for the rational function 
f(z) = (3z* + 1)/4z3. The 
two shades of gray, black 
and white, correspond to 
the basins of attraction of 
the four attractive fixed 
points of f(z). To which 
point in C do the orbits of 
points in the white region 
converge? 
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Figure VII.209. Mer- 
cator’s projection of the 
Riemann sphere show- 
ing the basins of attrac- 
tion of the four attractive 
fixed points of the Newton 
transformation of z* — 1. 
The top of the rectangle 
corresponds to the point 
at infinity, and the bottom 
of the box corresponds to 
the origin. The points —1, 
+i, +1, and —1 lie on the 
equator. The shading fol- 
lows the same convention 
as in (a) and (b). Which 
point on C is represented 
by the midpoint of this 
image? 
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with f. Let ze C be given and solve 
| 3w* +1 
=, 
4w? 
to find w in terms of z. This leads to the quartic equation 
3w* — 4zw? +1=0. 


This has four solutions, when we count solutions according to their multiplici- 
ties. We can organize these solutions to provide four functions; that is, we write 
rs —l(z) = {w (z), w2(z), w3(z), w4(z)}. Then the Julia set is the “attractor” for the 
IFS {C: w;,i = 1, 2, 3, 4}. However, as in the case of quadratic transformations on 
C, this statement must be treated cautiously: for example, clearly this IFS admits 
more than one invariant set. 


Theorem 3.1 Let f: C — C be the Newton transformation associated with the 
4 


polynomial z* — 1. Let € > 0 be very small. Let X = C \Wrsy B (aj, €) where a, = 1, 
az = —1, a3 =1, and a4 = —i. As above, define W : H(X) > H(X) by 

W(B) =U;_,w;(B) = f-1(B) _ forall BE H(X). 
Then W is continuous, possesses a unique fixed point, J, the Julia set of f , and 


lim W""(B)=J for all B € H(X). 


3 The Application of Julia Sef Theory to Newton’s Method 


Sketch of proof This is essentially the same as the sketch of the proof of 
Theorem 2.1. 

The Newton transformation associated with a polynomial may possess an attrac- 
tive cycle of minimal period greater than 1. This cycle may not be directly related 
to the roots of the polynomial. As as example consider the Newton transformation 
f (z) associated with the polynomial 


F(zj=2+QA-Dz+1. 


dX € C can be chosen so that f(z) possesses an attractive cycle {b;, b2} of minimal 
period 2. Figure VII.210 illustrates the basin of attraction of the cycle. The Escape 
Time Algorithm was used to obtain this image. Points whose orbits arrive within a 
distance 0.01 of the cycle, prior to 100 iterations, are plotted in black. Accordingly, 
the escape region is V = B(b,, 0.00001) U B(b2, 0.00001). Notice the resemblance 
of the basin of attraction of {b;, b2} to the filled Julia set for z7 — 1. This similarity is 
not accidental. It can be explained using the theory of “polynomial-like” mappings 
[Douady 1985]. 

Some interesting computergraphical experiments involving Julia sets for New- 
ton’s method are described in [Curry 1983], [Peitgen 1986], and [Vrscay 1986]. 
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Figure VII.210. The 
Escape Time Algorithm is 
applied to a Newton trans- 
formation f(z) associated 
with a cubic polynomial. 
f (z) possesses an attrac- 
tive cycle {b,, b2} of mini- 
mal period 2. The basin of 
attraction of the two-cycle 
is represented in white. 
Points whose orbits arrive 
within a distance of 0.01 
of the cycle, prior to 100 
iterations, are plotted in 
white. Does the basin of 
attraction of the cycle look 
familiar? 
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Examples & Exercises 


3.1. Verify that z = 1 is an attractive fixed point for the Newton transformation 
associated with F(z) = z* — 1. 


3.2. The Newton transformation associated with the polynomial F(z) = z* + 1 is 


<e. 1 
fe=3 (2-5). 


Show the corresponding IFS is {C: wi(z) =ztv24+ 1, w2(z) =z — V 227 +1}, 
where the square root is defined appropriately. Verify that A = R U {00} is an attrac- 
tor of the IFS. Prove, or give evidence to show, that A is the Julia set for f(z). (Hint: 
see exercise 3.7.) How could the space C be modified so that the IFS has a unique 
attractor? Notice that numerically computed orbits of points on A, under the dynam- 
ical system {C; f}, can be constrained from escaping from A by keeping imaginary 
parts equal to zero. Verify numerically that the dynamics of {A; f} are wild. 


3.3. Find the Newton transformation f(z) associated with the polynomial F(z) = 
z> — 1. Use the Escape Time Algorithm to obtain an image, analagous to Fig- 
ure VII.207, which illustrates this Julia set. Discuss the dynamics of the “petals” 
in the image. 


3.4. In this example we speculate on the application of fractal geometry to bio- 
logical modelling. Let F,(z) = (z — iA)(z — 1)(z + 1), where A is a real parameter, 
and let f;,(z) denote the associated Newton transformation. Let J, denote the Julia 
set for f,(z). In Figure VII.211 we show images relating to J,, for an increasing se- 
quence of values of 4. These images were computed by applying the Escape Time 
Algorithm to f,. 

These images show complex blobs that are reminiscent of something small, bi- 
ological, and organic. They make one think of the nuclei of cells; of collections of 
cells during the early stages of development of an embryo; of the process of cell di- 
vision; and of protozoans. As we track the images we see that the blobs pass through 
one another. Somehow they do so while preserving their complex geometries. Their 
geometries seem to interact with one another. Such images suggest that fractal ge- 
ometry can do more than provide a means for modelling static biological structures, 
such as ferns: it appears feasible to construct deterministic fractal models, which 
describe the processes of physiological change that occur during the growth, meta- 
morphosis, and movement of living organisms. 


3.5. Find the Newton transformation f(z) associated with the function F(z) = 
e* — 1. What are the attractive fixed points of the dynamical system {C; f}? Fig- 
ure VII.212 was computed using the Escape Time Algorithm applied to f(z), with 
W = {(x, y) € R*: —2.5 < x <2.5, —2.5 < y < 2.5}. Describe the main features of 
the image. Explain, roughly, the causes of some of these features. 


3.6. What are the “petals” in the case of the Julia set for z* — 1? Use a picture of 
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Figure VII.211. (a) 
(h) Julia sets associated 
with a one-parameter fam- 
ily of dynamical systems. 
Can such systems be used 
to model biological pro- 
cesses such as meiosis? 


Figure VII.211. (b) 
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Figure VII.211. (c) 


Figure VII.211. (d) 
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Figure VII.211. () 


Figure VIL211. (f) 
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Figure VII.211. (g) 


Figure VII.211. (h) 
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Figure VII.212. This 
image was computed us- 
ing the Escape Time Algo- 
rithm applied to the New- 
ton transformation associ- 
ated with f(z) =e? — 1. 
The viewing window 

is W = {(x, y) € R*: 
—2.5<x <2.5,—-2.5< 
y < 2.5}. Can you work 
out what “escape region” 
was used? 


AN UT (CS, ‘ a as hed 
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the Julia set for z7 — 1 to illustrate the orbit of a tiny petal that is eventually periodic 
with minimal period 2. 

3.7. By making an explicit change of the coordinates, using a Mobius transforma- 
tion, show that the following two dynamical systems are equivalent: 


Pe 1 1 _ 
{G; fig= 5 (- )) and {C; f(z) =z}. 


4 ARich Source for Fractals: Invariant Sets of Continuous Open Mappings 


Let f be a transformation that acts on a space X. Recall that a set A is invariant under 
f if f-'(A) =A, and that this implies f(A) = A. We are interested in invariant sets 
of f that belong to #{(X). The following theorem is a theoretical tool that provides 
both the existence and a means for the computation of invariant sets. Most of the 
material in this chapter is based on it. 


Theorem 4.1 Let (Y,d) be a metric space. Let X C Y be compact and 
nonempty. Let f : X — Y be continuous and such that f (X) D X. Then, (1) a trans- 
formation W : H(X) — H(X) is defined by 
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W(A) = f/(A) for all A € H(X). 
(2) W possesses a fixed point A € H(X), given by 
A= Wo f"(X) = lim W(X). 


Suppose f obeys the additional condition that f (O) is an open subset of the metric 
space (f (X), d) whenever O Cc X is an open subset of the metric space (X, d). Then 
(3) W is a continuous transformation from the metric space (H(X), h(d)) into itself. 


Proof of (1) and (2). The proof of (3) can be found in (Barnsley 
1988c).. (1) We begin by proving that W maps 71((X) into H(X). Let B € H(X). 
The condition f(X) > X implies that f~'(B) C X, and that f~'(B) is nonempty. B 
is compact, so it is a closed set in the metric space (X, d). It follows that X \ B is 
open. The continuity of f implies that f —!(X \ B) is open. Since f(X) DX DB, it 
follows that f~'!(B) = X \ f~!(X \ B). Hence f—'(B) is closed in the metric space 
(X, d). Since X is compact it follows that f~'(B) is compact. This completes the 
proof of (1). 

(2) Since f (X) D X, it follows that X > f°(X). Application of f°”) to both 
sides of the latter equation yields 


X= pee = f°) (X) ms pe eo f°” (X) oe 


It follows that { f°‘-”)(X)} is a Cauchy sequence in H(X), and it possesses a limit 
A € H(X), given by 


AZT = lim W(X). 


It remains to be proved that A is a fixed point of f. We need to show that 
F°D(N An) = NS gAn where A, = f°” (X) for n = 1, 2,.... First we prove 
that f°) (N® An) D NXpAn. Suppose x € f° (N%A,). Then there is y € 
N°,A, such that x inf°(y), and y € A, for n =0,1,2,.... It follows that x € 
f°—)(A,) = Angi for n =0, 1, 2,.... It follows that x € N°9Az. To prove the in- 
clusion the other way around, suppose that x € N?°2)A,. Then x € An41 = f tA 
for n =0,1,2,3,.... It follows that there is y, € A, such that f(y,) =x forn = 
0, 1,2,3.... The sequence {y,} possesses a convergent subsequence. Let y denote 
the limit of this subsequence. Then y € A, forn =0,1,2,..., and so y EM oAn. 
Since f is continuous it follows that f(y) = x. Hence x € f° )(N%°)A,). We have 
shown that f°) (MN 4An) C M&pAn. This completes the proof of (2). 


The invariant set A referred to in Theorem 4.1 can be expressed 
A={xeX: f(x) €X forall n =1, 2, 3,...}. 


That is, A is the set of points whose orbits do not escape from X. Also, A is the 
complement of the set of points whose orbits do escape. If X C R?, then pictures of 
A can be computed by using the Escape Time Algorithm. 
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The last statement in the theorem expresses a desirable property for a transforma- 
tion W on 7{(X). If W is not continuous, yet Ao € 7{(X) and {W°"(Ao)} converges 
to A € H(X), one cannot conclude that W(A) = A. Without continuity of W, one 
should not trust the results of applying the Escape Time Algorithm. For example, 
slight numerical errors may mean that a computed sequence of sets {A, ~ W™" (X)} 
is not decreasing. One may still wish to define A=lim A, Without continuity one 
cannot suppose that f~!(A) = lim W(A,) = A, even approximately. 

Analytic transformations map open sets to open sets. Hence their inverses act 
continuously on the space 7#{(X), where X C C is chosen appropriately. To help 
vizualize this, look back at Figure III.52. If the Sierpinski triangle ABO is deformed 
or moved, its inverse image POQ U POQ will move continuously with it. 

The Hausdorff metric is the metric of perception: what we call a small change in 
the appearance of a picture is probably a small change in a Hausdorff distance. When 
one talks about continuous motion in the context of graphics, continuous growth in 
the context of botany, or continuous change in the context of a chemical system, 
the word “continuous” can often be replaced, pedantically, by “continuous in the 
Hausdorff metric.” Theorem 4.1 suggests that one could use continuous open maps 
to model such systems. 


Examples & Exercises 
*4.1. Let A € [—1, 1]. Define a transformation f : R* > R* by 


ty (x2—y?—1,2xy)  whenx > 0, 
oe (x? — y?-—1+Ax, 2xy) when x <0. 


Show that f is continuous. Show that if X denotes a ball, centered at the origin, of 
sufficiently large radius, then f(X) > X. Also verify that if A € [—1, 0] then f maps 
open sets into open sets. Show that this is not the case for 1 = 1. (Hint: look at what 
the map does to a very small disk centered at the origin.) 


Figure VII.213 shows the result of applying the Escape Time Algorithm to f 
when A = 1. The inner region, bounded by an ellipse, actually represents a disk 
X such that f(X) > X. Different scales have been used in the x and y directions. 
f (X) is the region bounded by the outer curve. The image of a point that goes once 
around the inner ellipse is a point that goes twice around the origin, following the 
outer curve, which looks like a folded figure eight. Different “escape times” of orbits 
of points in X are represented by different gray tones. A magnified version of X, 
painted by escape times, is shown in Figure VII.214. Roughly speaking, regions 
closest to the outside escape fastest. Points in the white region also escape. So 
where is the invariant set A? It is right in the middle. It appears to be a branching, 
connected, tree-like set, with no interior. 

Figure VII.215 shows the result of applying the Escape Time Algorithm to f 
when A = 0. This time we see that the invariant set A, in the center, in white, is 
just the filled Julia set for z? — 1. 
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Figure VII.213. The result of applying the Escape Time Algorithm to the function f 

in example 4.1 with A = 1. This function is continuous and is such that f(X) > X, where 
X denotes the Fabergé egg in the middle. f(X) is the region bounded by the outer curve. 
Different “escape times” of orbits of points in X are represented by different gray tones. 


What happens if we choose A = —1? This time we obtain the image shown in 
Figure VII.216. However, this time things may not be as simple as they appear to be. 
The inner “layers” that surround the apparent invariant set A are highly irregular and 
unstable. That is, points that are very close together seem to have orbits that have 
very different escape times. Could it be that, although the mapping W : H(X) > 
H(X) is continuous, it has a very poor modulus of continuity? 


4.2. Construct a function f :X — R’, where X C R’, which obeys the conditions 
of Theorem 4.1. Use the Escape Time Algorithm to analyze the associated invariant 
set A described in the statement of the theorem. Your example should be interesting 
and of a different character from those specifically described in this chapter. 
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A magnified version of the region X in Figure VII.213. Approximately, 


Figure VII.214. 
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The Idea of a Parameter Space: A Map of Fractals 


A map with nothing marked on it is practically useless. A map of a 1000 x 1000 
square mile region containing the British Isles is shown in Figure VIII.217; it does 
not convey much information. However, as a concept it is quite exciting. Each lo- 
cation on the map corresponds to somewhere on Earth. For example, the dot with 
coordinates (750, 227.3) represents the town of Maidstone. A point on the map may 
represent a certain grain of soil in a ploughed field, or a molecule of flotsam on the 
top of some foam on the surface of the sea. Nearby points in the map correspond 
to nearby points on the Earth. Connected sets with interiors correspond to physical 
regions. 

How could a perfect map be made? Ideally, it should specify locations on the 
Earth’s surface at a certain instant. The coordinates would be relative to some abso- 
lute coordinate system, perhaps determined by reference to the fixed stars. Moreover, 
the surface of the Earth would have to be defined precisely, up to the last molecule 
of water, soil, and plant matter; for this purpose one can imagine using a straight line 
from the center of the Earth as suggested in Figure VIII.218. Of course, maps are not 
made like this, but the goal is the same: to have an accurate correspondence between 
points on the physical surface of the Earth and the physical surface of the paper. 

We must be careful how we interpret a map. Geographical maps are complicated 
by the real number system and the unphysical notion of infinite divisibility. Mathe- 
matically, the map is an abstract place. A point on the map cannot represent a certain 
physical atom in the real world, not just because of inaccuracies in the map, but be- 
cause of the dual nature of matter: according to current theories one cannot know the 
exact location of an atom at a given instant. 

Fractal geometers avoid this problem by pretending that the surface of the Earth is 
an abstract place too; we imagine, once again, that matter is infinitely divisible, and 


1 The Idea of a Parameter Space 


Figure VIII.217. A 
map with no information 
marked on it is quite excit- 
ing. A very unhelpful map 
of the British Isles. The 
dot represents Maidstone. 
The map is accurate up to 
dot size. 


Maidstone 


1000 miles 


€-——---. 1 0G miles ——----—_> 


that we can address every point. In the same spirit, we presume that we can model 
trees, clouds, horizons, churning seas, and infinitely finely defined coastlines. Then, 
for example, we can define the Hausdorff-Besicovitch dimension of the coastline of 
the British Isles. . 

For a map to be useful it must have information marked on it, such as heights 
above sea level, population densities, roads, vegetation, rainfall, types of underlying 
rock, ownership, names, incidence of volcanoes, malarial infestation, and so on. 
A good way of providing such information is with colors. For example, if we use 
blue for water and green for land then we can “see” the land on the map and we 
can understand some geometrical relationships. We can estimate overland distances 
between points, land areas of islands, the shortest sea passage from LLanellian Bay 


: Figure VIII.218. How 
coordinates = Figure VIII.217 might 


specified by 
the fixed stars Dave been made. 


point on map 


Soint on Earth’s surface 


exact center of Earth 
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Figure VII.219. In 
this map points corre- 
sponding to land have 
been shaded. A fascinat- 
ing entity, the coastline, is 
revealed. 


1000 miles 


m——1000 miles ————_> 


to Amylwch Harbour, the length of the coastline, etc. All this is achieved through 
the device of marking some colors on a blank map! 

Let us consider the boundary of the shaded region in Figure VIII.219. It is here 
that the map conveys extra information. In the interior of the shaded region we learn 
no more about the surface of the Earth than that “there is land there.’ However, on 
the boundary we learn not only that “there are land and sea there,” but also, if the 
map is accurate enough, a feature that we will actually “see” on the surface, namely 
the local shape of the coastline. 

The latter idea can be extended. If we include more colors on a geographical map, 
to provide more information about properties of the Earth’s surface, we produce 
more boundaries on the map. These boundaries can give information about local 
geometry. For example, a map finely colored according to elevation reveals the 
shapes of the bases of the mountains, the paths of rivers, and — if we look closely 
enough—the outlines of buildings. Such a map, made abstract and perfect, placed in 
a metric space, would contain much detailed information about what, at each point, 
the local observer would see. 


Examples & Exercises 


1.1. Study an atlas that contains maps colored according to diverse criteria, such as 
rainfall, population density, vegetation, and elevation. Discuss to what extent these 
maps provide information about the local geometry of the surface of the Earth. 


We turn attention to making colored maps of parameterized families of fractals. 
We consider families of iterated function systems and families of dynamical systems 
that depend on two real parameters. The collection of possible parameter values 
defines a parameter space associated with the family. We use the notation P to 
denote a parameter space. Typically P is a subspace of (IR*,Euclidean) such as m, 
a closed ball, or R?. 

An example of a parameter space is P = {(A1, Az) € R*: |Aj|, |Az| < 2795}. This 
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is a parameter space for the family of hyperbolic IFS {C; (A; + iAz)z + 1, (Ay + 
iA2)z — 1}. Each point A = (A, A2) € P corresponds to an IFS. Each IFS possesses a 
unique attractor, say A(A). Hence each point of P corresponds to a single fractal. We 
can think of P as representing part of 71(C), a space of fractals. A map of P, with a 
few points marked on it, is shown in Figure VIII.220. Each point in P corresponds 
to a single fractal. Nearby points in P correspond to nearby fractals, that is, points 
in H(C) whose Hausdorff distance apart is small. 

Another example is P = C, which provides a parameter space for the family of 
dynamical systems {C; f,(z) = 2” — A}. Each point in the parameter space corre- 
sponds to a different dynamical system. Each dynamical system is associated with a 
unique Julia set, J(A). The collection of fractals {J (A) : A € P}, associated with the 
parameter space, is vast and diverse. 

Let X be a two-dimensional metric space such as R? or C. Let P denote a pa- 
rameter space corresponding to a family of fractals {A(A) € H(X) :A € P}. Can we 
provide the explorer of fractals, who wishes to investigate this collection of fractals, 
with a colored map? This map should give her information about the sets A(A) to be 
found at different points on P. 

Suppose P =m. To make a map, let us represent P by the set of pixels on the 
screen of a computer graphics monitor. The idea is to color the pixel 4 according 
to some property of A(A). (We write A = pixel = point in P, without repeatedly 
explaining that A is a point in the small rectangle in P that corresponds to the pixel.) 


Figure VIII.220. An 
example of a parameter 
space. Each point A in 
the space corresponds to 
a fractal, A(A). This is a 
poor map, because very 
little has been marked on 
it. It is like the map of 
the British Isles shown in 
Figure VIII.217. It needs 
coloring. 
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Suitable properties could relate to the connectivity of A(A), the fractal dimension of 
A(A), the escape time near a special point on A(A) under an associated dynamical 
system, the number of holes in A(A), or the presence of straight lines in A(A), for 
example. 

If we make a good selection of the properties to associate with colors, the result 
will be a useful map containing various differently colored regions. This map will be 
a ready reference for the explorer of fractals. It will tell him something about what 
to expect at he travels about P. He might be suprised nonetheless. 

The boundaries of the colored regions can provide additional geometrical infor- 
mation to the explorer, over and above the information that the map was originally 
designed to convey. It sometimes occurs that the local shapes of the boundaries in 
the map reflect the shapes of the corresponding fractals. There is a deep principle 
here, which we shall not pin down as a theorem, but which we will illustrate in a 
number of cases. 


Examples & Exercises 


1.2. It is often useful, when establishing the Lipshitz continuity condition in Theo- 
rem 11.1 in Chapter III, to remove dependencies on x € X by restricting the domain 
of the IFS to a compact set B that contains the attractors A(A) for all values of A € P. 
Since this does not change the attractors themselves, continuous dependency on A in 
B then guarantees it for the original IFS. This can be done in practice by means of 
the following lemma: 

Lemma 1.1 Let B € H(X) such that for all p € P, W(p, B) C B. Then for all 
pEP,A(p) cB. 

Prove the lemma. 
1.3. Let P = {(Aj, Az) € R?: [Aj], |Az| < 0.9}. The family of IFS {IR; A x, Asx + 
1 — Ao, 0.5x + 0.5} is hyperbolic, with contractivity factor s = 0.9, for all A € P. 
Use Theorem 3.11.1 and the lemma of the previous problem to prove that the attrac- 
tor A(A) depends continuously on A. 
1.4. The family of IFS {[0, 1]; Ayx?, A2x + (1 — A2)} is hyperbolic, with contractiv- 
ity factor s = 0.9, for all A in the parameter space 

P = {(Aj, Az) € R?:0 <A, <0.45,0 <A> < 0.9}. 
Since we have 
d(wy,(x), wi,(x)) = |x7|IA1, — Aig] < lai, — Aig! 
d(w2,(x), w,(x)) = |x — 1|]A2, — A2,| S |A2, — Aa, I, 

we can satisfy the Lipshitz condition with k = 1 in both cases. The attractor depends 
continuously on A. 


1.5. Sometimes the Lipshitz condition is not that convenient to prove directly. In 
this case, we can go directly to the continuity statement that it is used to guarantee. 
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Suppose we can show directly that w;(p, x) is a continuous function from P x X > 
X. That is, without holding either p or x fixed, given € > 0, there is a 6; > 0 for each 
w;i(p, x), such that : 


d(w;(p, x), wil(g, y) <€ whenever d((p,x), (g,x)) < §j. 


Then the conclusions of Theorem 11.1 in Chapter III hold. This does not mean 
that the Lipshitz condition (or something like it) is unnecessary; such conditions 
guarantee the statement we have just made. If we can guarantee it some other way, it 
is equivalent. 


1.6. An example of a parameter space is (P,Euclidean), where P = {(Aj, A2) € R?: 
|A1|, |A2| < 0.999}. This space can be used to represent the family of hyperbolic IFS 


{IR*; w1, w2}, where 
a) Xx ee d} 0 Xx 4 0 
<9 0 a as 
- PAO 02 x 
ah y P AOA AO Ny )* 
Since (using (z, w) = (A, Az) to emphasize familiarity) 


w(x, y, Z, W) = (zx, wy + 1) 


is continuous from basic calculus, Theorem 11.1 in Chapter III holds, and the attrac- 
tor varies continuously with A. 


2 Mandelbrot Sets for Pairs of Transformations 


Let P c R? be a parameter space corresponding to a family of fractals. That is, 
we have a function A: P — H(X), so that each point 4 € P corresponds to a set 
A(A) € H(X). One way to make a map is to color the parameter space according to 
whether or not A(A) is connected. 


Theorem 2.1 Let {X; w;, w2} be a hyperbolic IFS with attractor A. Let w; and 
w> be one-to-one on A. If 


w1(A)  w2(A) = %, 
then A is totally disconnected. If 

wi(A) NM w2(A) £9, 
then A is connected. 


Proof Suppose that w,(A) M w2(A) = @. Let & denote the code space map asso- 
ciated with the IFS. By Theorem 2.2 in Chapter IV the code space map @ : £ — A is 
invertible. ¢ is also a continuous transformation between two compact metric spaces. 
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Hence, by Theorem 8.5 in Chapter II, @ is a homeomorphism. Hence A is homeo- 
morphic to code space, which is totally disconnected. (Recall that code space on two 
or more symbols is metrically equivalent to a classical Cantor set.) It follows that A 
is totally disconnected. 

Suppose that w;(A) M w2(A) # Y. Then there is at least one point x € w,(A) NM 
w2(A). This point x has two addresses, say 


x= O(C) = O(o), where 2, = 1 and o,; = 2. 


Let us see what happens if we additionally suppose that “A is not connected.” Then, 
since A is compact, we can find two nonempty compact sets E and F so that 


Using compactness, there is a positive real number 4 so that 
d(e, f)=64 forallee E, f eF. 


Let a and wy be a pair of codes that agree through the first K symbols, for some 
positive integer K. That is 7; = W; fori =1,2,..., K. Then 


d(o(x), 6(W)) < s*diam(A), 


where diam(A) = Max{d(x, y): x, y € A}, ands € [0, 1) is acontractivity factor for 
the IFS. Suppose also that ¢(7) € E and @(y) € F. Then 


; 6<d(o(x), o(y)). 
Combining the latter two inequalities we discover 5 < s“diam(A), which implies 
log(6/diam(A)) 

log(s) 
We conclude that if e ¢ E and f ¢€ F then the number of successive agreements 
between an address of e and an address of f cannot exceed the number on the right- 
handside here. It follows that there is a maximum number, M, of initial agreements 
between the address of a point e € E and a point f € F; and this maximum is 


achieved on some pair of points, say e and f. Then we can find p; € {1,2} for 
i =1,2,..., M such that 


K < 


(01, 02, 03, +++) PMs l,...J=eeEE 


and 


PUP; Pr: Ps, <5 Pais So PT ee 
Now consider the point z € A, which has the two addresses 
4 = $(1, P2; P3,-++5 PMs L, o2, G3, b4, .- 3 
= $(p1, P2; P35 -++5 PM; 2, O94, OF; OAs» ‘di 


Suppose z € E. Then its address agrees with that of f € F through (M + 1) initial 
symbols. Hence z € F’. But then its address agrees with that of e € E through (M + 
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A IMAGINARY S Figure VIII.221. A 
: +. map of the family of IFS 
ol {C; Az — 1, Az + 1}, where 


the parameter space is 
P= fi .= {Ay, Ao) € C: 
|A| < 1}. This picture of 
a parameter space is ob- 
tained by “painting” black 
where the attractor of 
the IFS is disconnected 
and light where it is con- 
nected. The Mandelbrot 
set is the light region, the 
sea. It contains a dragon at 
A = (0.5, 0.5). 


1) initial symbols, which is not possible. We have a contradiction. Hence “A is 
not disconnected.” It follows that A is connected. This completes the proof of the 
theorem. 

Definition 2.1 Let {X; w;, w2} be a family of hyperbolic IFS that depends on a 
parameter i € P C R*. Let A(A) denote the attractor of the IFS. The set of points 
M C P defined by 


M = {iV € P: A(A) is connected} 

is called the Mandelbrot set for the family of IFS. 

For the rest of this section we consider the family of IFS 

{C; Az —1,Az+ ]}, 
where the parameter space is 
P={A= (Aj, An) €C 147 +23 < 1}. 

Figure VIII.221 shows a picture of the associated Mandelbrot set, M. This is a map 
for the collection of fractals associated with the IFS. It has been colored dark where 
the attractor is totally disconnected and light where it is connected. 

Here is an outline of an algorithm to compute images of the Mandelbot set M 


associated with the family {C; w;(z) = Az — 1, w2(z) =Az + 1}. It is based on The- 
orem 2.1. 
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Aigorithm 2.1 Example of Method for Making Pictures of the 
Mandelbrot Set of a Family of IFS. 


(1) Choose a positive integer, L, corresponding to the amount of computation one 
is able to do. The greater the value of L, the more accurate the resulting map 
image will be. 

(2) Represent the parameter space P = {X € C: |A| < 1} by an array of pixels. 
Carry out the following steps for each X in the array. 

(3) Calculate a number R, so that the attractor is contained in a ball of radius R, 
centered at the origin; that is, choose R > 0 so that A(A) C B(O, R). 

(4) Compute the number 


H = min{d(x, y): x € wi\(W ({0})), y € wo(W°“({0}))}, 


where W = w; U wp. If H < 2|A|4t!R, then the pixel is assumed to belong 
to M and is colored accordingly. 


Step (4) is based on the following observation. The attractor of the IFS is 
contained in the set W°“+")(B(O, R)), which consists of 24+ balls of radius 
|A|-+!R. The centers of these balls lie in the union of the two sets w;(W°“(0)) and 
w2(W°~(0)). If H is greater than 2\A|£*+'R, then A(A) must be disconnected. 


Figure VIII.222 shows the “coastal region” of a quarter of the complement of the 
Mandelbrot set in Figure VIII.221. It has been laid over a grid in order to help you 
locate points where interesting fractals lie. 

The boundary of M is complicated and intricate. Close-ups of the “coast- 
line” near the places marked (a), (b), and (c) are shown in Figure VIII.223. Fig- 
ure VIII.224 shows a zoom on the spiral peninsula in Figure VIII.222. 

Now look at Figures VIII.225 and VIII.226, which show pictures of the attractors 
A(A) for some points A located near the boundary of the Mandelbrot set. There is 
a “family resemblance” between the places on the boundary from which the fractals 
come, and the fractals themselves. To help see this you should look back at the close- 
ups on the coastline in Figure VIII.223. Figure VIII.227 shows the IFS attractor 
corresponding to the tip of the peninsula in Figure 224. Notice how it contains 
spirals, very much like the ones in the peninsula in parameter space. At the end of 
this chapter we make some comments on why such “family resemblances” occur. 

The following theorem provides rigorous bounds on the locations of M and aM. 
The proof is delightful, because it relies on a fractal dimension estimate. 


Theorem 2.2 [Barnsley 1985c] The attractor A(A) of the IFS {C; Az — 1,Az + 
1} is totally disconnected if || < 0.5 and connected if 1 > |A| > 1//2. The bound- 
ary of the associated Mandelbrot set is contained in the annulus 1/2 < |A| < 1/V2. 


Proof Let A denote the attractor of the IFS and let D(A) denote its fractal di- 
mension. The two maps in the IFS are similitudes of scaling factor |A|. This means 
that Theorem 2.3 in Chapter V can be applied. 
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HH © Figure VIII.222. This 
shows the coastal region 
of a quarter of the com- 
plement of the Mandelbrot 
set in Figure VIII.221. 
It has been laid over a 
grid in order to help you 
locate points where inter- 
esting fractals lie. Close- 
ups of the coast at (a), 
(b), and (c) are shown in 
Figure VIII.223. The co- 
ordinates of the grid are 
(0, 0) — —(0.71, 0.71). 


Suppose that A is totally disconnected. Then the IFS is totally disconnected and, 
by Theorem 2.3 in Chapter V, 


D(A) = log(1/2)/ log(|a\). 
By Theorem 2.1 in Chapter V, D(A) < 2. Hence 
log(1/2)/log(|A|) < 2. 


This implies that |A| < 1/ ft 

Suppose that A is connected. Then it contains a path that connects two dis- 
tinct points. The fractal dimension of any path is greater than or equal to 1. Hence 
D(A) > 1. However, by Theorem 2.3 in Chapter V, 


D(A) < log(1/2)/log(|A)). 
It follows that 
1 < log(1/2)/log(|A}). 
This implies that |A| > 1/2. This completes the proof of the theorem. 


A different point of view on the Mandelbrot set considered above is given by 
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Figure VIII.223. Close- (a) 
ups of the boundary of the 
Mandelbrot set at (a), 
(b), and (c). The diverse 
structures in this bound- 
ary echo the shapes of 
the attractors of the corre- 
sponding IFS. 


Figure VIII.228. M has been turned inside-out by making the change of variables 
d’ = )~'. The inner white disk is no-man’s land; it does not belong to the parameter 
space. Also included are the two bounds provided by Theorem 2.2, namely the circle 
|A’| = 2 and the circle |A’| = /2. The fractal dimension decreases with increasing 
distance from the origin. 


Examples & Exercises 


2.1. Sketch the Mandelbrot set for the family of IFS {R; Ayx + A2, A2x + A,} where 
the parameter space is P = {(Aj, Az): |Ay|, |A2| < 1}. 


2.2. Let {X; wi, w2} be a family of hyperbolic IFS which depends on a parameter 
4 € P CR’. Let w; and w2 depend Lipshitz continuously on A for some k > 0 and 
for fixed x € X. Assume that the IFS has contractivity factor s € [0, 1) which is 
independent of 2 € P. Then by Theorem 11.1 in Chapter III, the function A: P > 
H{(X) is continuous. Use this continuity to prove that the Mandelbrot set associated 
with the family of IFS is closed. It is suggested that you begin by showing that the 
set S = {B € H(X) : B is not connected} is an open subset of #((X). 
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2.3. Use Figure VIII.222 to determine some values of A that belong, approximately, 
to the boundary 0M of the Mandelbrot set. Compute images of the correspond- 
ing attractors. Compare images corresponding to two points A, and A2 € dM, with 
|Ai| < |A2|. Explain why the picture of A(A,) is more delicate than the picture of 
A(A2). Also comment on similarities and differences between your images and the 
local geography of the parts of 0M from which they come. 


2.4. The pictures of the Mandelbrot set associated with the family of IFS {C; Az — 


Figure VIII.224. Close- 
up on the spiral penin- 
sula on the edge of 

the Mandelbrot set in 
Figures VIII.221 and 
VIiI.222. What infor- 
mation about the corre- 
sponding fractals does this 
boundary convey? 
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Figure VIII.225. Some 
of the fractals to be found 
at various points near the 
boundary of the Mandel- 
brot set associated with 
the parameterized family 
of IFS {C; Az —1,Az+ 1}. 


1, Az + 1} suggest the conjecture that M is symmetric about the x-axis and about 
the origin. Prove the conjecture. 


2.5. An interesting point in the parameter space for the family {C; Az — 1, Az + 1} 
is A = (1/2, 1/2). This lies on the circle 1/|A| = |A’| = /2 in Figure VIII.228. It 
appears to be located in the interior of the Mandelbrot set, although the IFS is just- 
touching. It corresponds to the Twin-Dragon Fractal. A picture of it is shown in 
Figure VIII.229. It is possible to tile the plane with Twin-Dragons. Various other 
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Figure VIlII.226. Some 
of the IFS attractors to be 
found at various points 
near the boundary of the 
Mandelbrot set associated 
with the parameterized 
family of IFS {C; Az — 
1,Az + 1}. Where would 
you look for an interesting 
fractal? 


values of A also correspond to tilings of the plane. See [Gilbert 1982]. Show that the 
attractor at the point A = (0, 1/ /2) can be used to tile the plane. 


2.6. Notice the line segments on the real axis in Figure VIII.228. In [Barnsley 
1985c] it is proved that 


{7€C:0.5 <A; < 0.53; Az = 0} CM. 
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Figure VIII.227. At- 
tractor of the IFS {C; Az — 
1, Az + 1} corresponding 
to the value of A at the 
tip of the spiral peninsula, 
shown in Figure VIII.224. 


For A in such a line segment what does the attractor look like? Are you suprised, in 
view of what you know about maps of coastlines? 


2.7. Some of the most delicate attractors of the family {C; Az — 1,Az + 1} are 
associated with points on 0M where it touches the circle 1/|A| = |A’| = 2. These 
have the lowest possible fractal dimension while still being connected. Let us call 
these attractors tree-like if w;(A) M w2(A) is a single point. Argue (or, better yet, 
prove) that a tree-like attractor A contains no trapped holes; that is, A contains 
no nontrivial, non—self-intersecting paths that start and finish at the same point. A 
picture of a tree-like attractor is shown in Figure VIII.230. 


2.8. Let e = eje2€3...e,.... be a point in the code space © of two symbols, with 
én € {+1, —1} for all n. Let A € C. Prove that the series 


FOF OF ORF Os" FE + et er ae +> 


has radius of convergence 1. What is the relationship between f(A) and the code 
space map @ : 1 — A(A) associated with the family of IFS {C; Az — 1, Az + 1}? Let 
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DECREASING 
FRACTAL 
DIMENSION 


DECREASING Figure VIII.228. Inside- 
FRACTAL out picture of the 
DIMENSION Mandelbrot set for 


{C; Az — 1,Az + 1}. It 
has been turned inside-out 
| by making the change of 


variables A’ = A~!. The in- 

ner white disk is no-man’s 

land; it does not belong 

to the parameter space. 

| | | The figure also includes 
i ft 4 the two bounds provided 

; i. the circle |A’| = 2 and the 
circle |A’| = /2. 


» TOTALLY 
DISCONNECTED 

HERE AND IN 

BLACK 


|A| < 1. Show that the attractor of the IFS is the union of all points which can be 
written in the form 


ddek deck kd tt dt oh ot pt te te .., 


3 The Mandelbrot Set for Julia Sets 


In this section we introduce a good method for making maps, such as might be found 
in an atlas, of families of dynamical systems. The method is based on the use of 
escape times and is discussed more generally in section 4. Here we restict attention 
to the family 


(C; A(z) = 2A}, 


where the parameter space is P = C. This family is of special importance because 
it provides a model for the onset of chaotic behavior in physical and biological 
systems; see [May 1976] and [Feigenbaum 1979]. Moreover, it was the first family 
of dynamical systems for which a useful computergraphical map was constructed, 
by Mandelbrot. We concentrate on map making. 


310 Chapter Viil 


Figure VIII.229. The 
Twin-Dragon Fractal. You 
can tile the plane with 
these sets. Although it is 
just-touching, it appears 
to lie in the interior of the 
Mandelbrot set. 


Parameter Spaces and Mandelbrot Sets 


The Julia set J(A) associated with f,(z) is symmetric about the origin, O. We 
know this because the filled Julia set, of which J(A) is the boundary, is the set of 
points whose orbits remain bounded. The orbit of z € C remains bounded if and only 
if the orbit of —z remains bounded. 

For some values of 4 € P, O belongs to the filled Julia set, F(A), while for others 
it is quite far from F(A). This suggests that we try to color the parameter space 
according to the distance from O to F(A). How can we estimate this distance? An 
approximate method is to look at the “escape time” of the orbit of O. That is, we 
can color the parameter space according to the number of steps along the orbit of 
O that are required before it lands in a ball around the point at infinity, from where 
we know that all orbits diverge. The intuitive idea is that the longer an orbit of O 
takes to reach the ball, the closer O must be to F(A). Of course, if an orbit does not 
diverge then we know that O € F(A). 

Suppose that we want to make a map corresponding to a region W C P. Here we 
choose 


W= {A = (4, A2) EC? [Ai], [Aa] < 2}. 
Let R > O and define 
V(R) = {z EC: |z| > R} U {oo}. 
Suppose 
R > 0.5 + 0.25 + |A|. 


Then it is readily proved that the orbit { f°"(z)} diverges if and only if it intersects 
V(R). So if we choose R = 10 we are sure that, for all 4 € W, the orbit {f°"(O)} 
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diverges if and only if it intersects V(R). Let us see what happens if we color the 
pixels of W according to the number of iterations required to enter V(10). 

The following program is written in BASIC. It runs without modification on an 
IBM PC with Enhanced Graphics Adaptor and Turbobasic. On any line the words 
preceded by ” are comments: they are not part of the program. 


Program 1. (Algorithm for Coloring Parameter Space According to an Escape Time) 
*Define viewing window, W, and numits. 
numits = 20: a2=-2:b2#-2: c#®2:d22: M = 100 
R =10 ’Define the region V. 
screen 9 : cls ’Initialize graphics. 
for p =1 to M 


for q =1 to M 


’Specify the value of lambda (k,1) in P 
k = at(c-a)*p/M : 1 = b+(d-b)*q/M 


Figure Vill.230. A 
“tree-like” attractor 

A from the family 

(C; wi(z) =Az—1,uw2= 
Az + 1}. The two sets 
w,(A) and w2(A) meet 
approximately at a single 
point. 
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*Specify the initial point, 0, on the orbit 
x =0 : y =0 


*>Compute at most numits points on the orbit of 0 
for n = 1 to numits 


newx = x*x-y*y-k 
newy = 2*x*y-1 
x = newx : y = newy 


7If the most recently computed point lies in V then... 
if x*xty*y > R then 


>... render the pixel (p,q) in color n, and 
’go to the next (p,q). 
pset(p,q),n : n = nunits 


end if 
if instat then end ’Stop computing if any key is pressed! 
next n : next q : next p 


end 


Color Plate 16 shows the result of running a version of Program 1 on a Masscomp 
5600 workstation with Aurora graphics. 

In Figure VIII.231 we show the result of running a version of Program 1, but this 
time in halftones. The central white object corresponds to values of 4 for which the 
computed orbit of O does not reach V during the first numits iterations. It represents 
the Mandelbrot set (defined below) for the dynamical system {C: z? — i}. The bands 
of colors (or white and shades of gray) surrounding the Mandelbrot set correspond 
to different numbers of iterations required before the orbit of O reaches V(10). 
The bands farthest away from the center represent orbits that reach O most rapidly. 
Approximately, the distance from O to F(A) increases with the distance from A to 
the Mandelbrot set. 


Definition 3.1 The Mandelbrot set for the family of dynamical systems {C; z? — 
A} is 
M = {i € P: J(A) is connected}. 


The relationship between escape times of orbits of O and the connectivity of J(A) 
is provided by the following theorem. 
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Figure VIII.231. The 
Mandelbrot set for z? — A, 
computed by escape times. 


Theorem 3.1 The Julia set for the family of dynamical systems {C; fx(z) = 
z*—dA},’ € P=C, is connected if and only if the orbit of the origin does not escape 
to infinity; that is 


M={rAEC:|f°"(0)| A co asn > 00} 


Proof This theorem follows from [Brolin], Theorem 11.2, which says that the 
Julia set of a polynomial, of degree greater than 1, is connected if and only if none 
of the finite critical points lie in the basin of attraction of the point at infinity. f)(z) 
possesses two critical points, 0 and oo. Hence J(A) is connected if and only if 
[f,"(O)| A oo asn —> ©”. 

In this paragraph we discuss the relationship between the Mandelbrot set for 
the family of dynamical systems {C; z* — i} and the corresponding family of IFS 
{C: /z +A, —V/z +A}. We know that for various values of A in C the IFS can 
be modified so that it is hyperbolic, with attractor J(A). For the purposes of this 
paragraph /et us pretend that the IFS is hyperbolic, with attractor J(A), for alld € C. 
Then Definition 2.1 would be equivalent to Definition 3.1. By Theorem 2.1, the 
attractor of the IFS would be connected if and only if w;(J(A)) N w2(J(A)) $ @. But 
w1(C) N w2(C) = O. Then it would follow that the attractor of the IFS is connected if 
and only if 0 € J(A). In other words: we discover the same criteria for connectivity 
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of J(A) if we argue informally using the IFS point of view, as can be proved using 
Julia set theory. This completes the discussion. 


We return to the theme of coastlines and the possible resemblance between fractal 
sets corresponding to points on boundaries in parameter space and the local geome- 
try of the boundaries. Figures VIII.232 and VIII.233 show the Mandelbrot set for 
z* — i, together with pictures of filled Julia sets corresponding to various points 
around the boundary. If one makes a very high-resolution image of the boundary 
of the Mandelbrot set, at a value of A corresponding to one of these Julia sets, one 
“usually” finds structures that resemble the Julia set. It is as though the boundary of 
the Mandelbrot set is made by stitching together microscopic copies of the Julia sets 
that it represents. An example of such a magnification of a piece of the boundary of 
M, and a picture of a corresponding Julia set, are shown in Figures VIII.234 and 
VIMI.235. 

If you look closely at the pictures of the Mandelbrot set M considered in this 
section, you will see that there appear to be some parts of the set that are not 
connected to the main body. Pictures can be misleading. 


Theorem 3.2 [Mandelbrot-Douady-Hubbard] 
The Mandelbrot set for the family of dynamical systems {C; z* — A} is connected. 


Proof This can be found in [Douady 1982]. 


The Mandelbrot set for z? — A is related to the exciting subject of cascades of bi- 
furcations, quantitative universality, chaos, and the work of Feigenbaum. To learn 
more you could consult [Feigenbaum 1979], [Douady 1982], [Barnsley 1984], [De- 
vaney 1986], [Peitgen 1986], and [Scia 1987]. 


Examples & Exercises 


3.1. Rewrite Program 1 in a form suitable for your own computergraphical envi- 
ronment. Run your program and obtain hardcopy of the output. Adjust the window 
parameters a, b, c, and d, to allow you to make zooms on the boundary of the Man- 
delbrot set. 


3.2. Figure VIII.236 shows a picture of the Mandelbrot set for the family of dynam- 
ical systems (C: z” — A} corresponding to the coordinates —0.5 < A, < 1.5, -1.0< 
Az < 1.0. It has been overlaid on a coordinate grid. The middle of the first bubble has 
not been plotted, to clarify the coordinate grid. Let Bp, B,, Bz, B3, ... denote the se- 
quence of bubbles on the real axis, reading from left to right. Verify computationally 
that when A lies in the interior of B, the dynamical system possesses an attractive 
cycle, located in C, of minimal period 2”, for n = 0, 1, 2, and 3. 


3.3. The sequence of bubbles {B,}°°, in exercise 3.2 converges to the Myreberg 
point, A = 1.40115.... The ratios of the widths of successive bubbles converge to 
the Feigenbaum ratio 4.66920 .... Make a conjecture about what sort of “attractive 
cycle” the dynamical system {C; z* — A} might possess at the Myreberg point. Test 
your conjecture numerically. You will find it easiest to restrict attention to real orbits. 
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Figure VIII.232. Man- 
delbrot set for z* — A, dec- 
orated with various Julia 
sets and filled Julia sets. 


Figure Vill.233. Man- 
delbrot set for z? — A, 
decorated with various 
Julia sets and filled Julia 
sets. These often resemble 
the place on the boundary 
from which they come, es- 
pecially if one magnifies 
up enough. 
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Figure VIII.234. A 
zoom on a piece of the 
boundary of the Mandel- 
brot set for z? — A. 


Figure VIII.235. A 
filled Julia set corre- 
sponding to the piece 
of the coastline of the 
Mandelbrot set in Fig- 
ure VIII.234. Notice the 
family resemblances. 
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Figure VIII.236. A 
picture of the Mandel- 
brot set for the family 

of dynamical systems 
{C; z* — A}. It has been 
overlaid on a coordinate 
grid. The middle of the 
first bubble has not been 
plotted, to clarify the coor- 
dinate grid. 


3.4. Make a parameter space map for the family of dynamical systems {C; f,(z)}, 
where /, is the Newton transformation associated with the family of polynomials 


F(2j=2+A-)Dz4+1, Ae P=C. 


Notice that the polynomial has a root located at z = 1, independent of A. Color 
your map according to the “escape time” of the orbit of O to a ball of small radius 
centered at z = 1. Use black to represent values of A for which O does not converge 
to z = 1. Examine some Julia sets of f, corresponding to points on the boundary of 
the black region. Are there resemblances between structures that occur in your map 
of parameter space, and some of the corresponding collection of Julia sets? (The 
correct answer to this question can be found in [Curry 1983].) 


4 How to Make Maps of Families of Fractals Using Escape Times 


We begin by looking at the Mandelbrot set for a certain family of IFS. It is disap- 
pointing, and we do not learn much. We then introduce a related family of dynam- 
ical systems and color the parameter space using escape times. The result is a map 
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Figure VIII.237. The 
complement of the Man- 
delbrot set M, associated 
with the family of IFS 
{C; w) =Az+1,u2= 
A*z — 1}. Points in the 
complement of the Man- 
delbrot set are colored 
black. The boundary of 
M, is smooth and does 
not reveal much infor- 
mation about the family 
of fractals it represents. 
The figure also shows 
attractors of the IFS corre- 
sponding to various points 
on the boundary of M,. 
What a disappointing map 
this is! 


packed with information. We generalize the procedure to provide a method for mak- 
ing maps of other families of dynamical systems. We discover how certain bound- 
aries in the resulting maps can yield information about the appearance of the fractals 
in the family. That is, we begin to learn to read the maps. 

Figures VIII.237 and VIII.238 show the Mandelbrot set M, for the family of 
hyperbolic IFS 


{C; w,(z) =Az + 1, wo(z) =A*z -—1}, P= (AEC: fA] < 1}. 


We use the notation A* = (A, + iA2)* = (A; — iA2) for the complex conjugate of 1. 
The two transformations are similitudes of scaling factor |A|. At fixed A, they rotate 
in opposite directions through the same angle. The figures also show attractors of 
the IFS corresponding to various points around the boundary of the Mandelbrot set. 
What a disappointing map this is! There are no secret bays, jutting peninsulas, nor 
ragged rocks in the coastline. 


Theorem 4.1 (Hardin 1985). The Mandelbrot set My, is connected. Its 
boundary is the union of a countable set of smooth curves and is piecewise dif- 
ferentiable. 


Proof This can be found in [Barnsley 1988d]. 
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SO _ Agu VINL238. The Vill.238. The 
complement of the Man- 
delbrot set M, associated 
with the family of IFS 
{C; w, =Az+1,u.= 
A*z — 1}, together with 
some of the corresponding 
fractals. Notice how these 
have subsets of points 


that lie on straight lines, 
like the local structure of 
aM). 


Let us try to obtain a better map of this family of attractors. In order to do so we 
begin by defining an extension of the associated shift dynamical system, for each 
X. € P \ M;,. Let A(A) denote the attractor of the IFS. One can prove that A(A) is 
symmetric about the y-axis. Hence A(A) € M, if and only if A(A) intersects the 
y-axis. Define f, :C > C 


- fw). — Rez > 0; 
hs wy'(z)  ifRez <0. 
Then, when A is such that A(A) is disconnected, {A(A); f,} is the shift dynamical 
system associated with the IFS; {C; f,} is an extension of the shift dynamical system 
to all of C; and A(A) is the “repelling set” of {C; f,}. This system can be used to 
compute images of A(A) in the just-touching and totally disconnected cases, using 
the Escape Time Algorithm, as discussed in Chapter VII, section 1. 

We make a map of the family of dynamical systems {R?; f,}, 1 € P. To do this we 
use the following algorithm, which was illustrated in Program 8.3.1. The algorithm 
applies to any family of dynamical systems {R*; f,} that possesses a “repelling set” 
A(A), and such that P is a two-dimensional parameter space with a nice classical 
shape, such as a square or a disk. 
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Aigorithm 4.1 Method for Coloring Parameter Space According to 
an Escape Time. 


(i) Choose a positive integer, numits, corresponding to the amount of computa- 
tion one is able to do. Fix a point Q € R* such that Q € A(A) for some, but 
not all,rA € P. 

(ii) Fix.a ball B Cc R? such that A(A) C B for alld € P. Define an escape region 
to be V= R?\ B. 

(iii) Represent the parameter space P by an array of pixels. Carry out the fol- 
lowing step for each i in the array. 

(iv) Compute { f°"(Q) :n = 0, 1, 2,3, ...,mumits}. Color the pixel 4 according 
to the least value of n such that fo" (Q) € V. If the computed piece of the 
orbit does not intersect V, color the pixel black. 


The result of applying this algorithm to the dynamical system defined above, with 
Q = O, is illustrated in Figures VIII.239 and VIII.240 (a)(g) and Color Plates 17 
and 18. 

Figure VIlI.239. A Figure VIII.239 contains four different regions. The first is a neighborhood of O, 
map of the family of dy- surrounded by almost concentric bands of black, gray, and white. The location of 
namical systems {C; f,}, this region is roughly the same as that of P \ My, which corresponds to totally dis- 
where connected and just-touching attractors. The second region is the grainy area, which 

z—1)/A ifRez=0; 

A@ = : + if Re z <0. 


2S SS BBS SHB £ER64RS ESOS rTTYTITILTiivrwustiststtttlitritiitist Ilr iPyrT iT Tre ieee Ebb 
id te ad pt habe bp SESS SESESESSSSSESTSESSSEBPSSCCSSTISSEVEBSSESVATF SSS 
ae gaa s 


The parameter space is 
1,0 <A, < 0.75}. The 
map is obtained by ap- 
plying Algorithm 8.4.1. 
Pixels are shaded accord- 
ing to the “escape time” 
of a point O € R*. The a GC, semen 
exciting places where the 7 Pee Ee es 
interesting fractals are to ee 
be found are not within 
the solid bands of black, 
gray, or white, but within 
the foggy coastline. This 
coastline is itself a fractal 
object, revealing infinite 


complexity under mag- ates HEHE esi cn cs 
nification. In it one finds ROE Un ania Region : 
approximate pictures of ws gS : 
some of the connected ae sect a8 aH ag i 7 


and “almost connected” 
repelling sets of the dy- 
namical system. Why are 
they there? 
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Figure VIII.240. A se- 
quence of zooms on a 
piece of the foggy coast- 
line in Figure VIII.239. 
The window coordinates 
of the highest power 
zoom are 0.4123 <A, < 
0.4139, 0.6208 < Az < 
0.6223. Can you find 
where each picture lies 
within the one that pre- 
cedes it? 


Figure VIII.240. (b) 
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Figure VIII.240. (c) 


Figure VIII.240. (d) 
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Figure VIII.240. (e) 


Figure VIII.240. (f) 
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Figure VIlI.240. (g) 


we refer to as the foggy coastline. Here, upon magnification, one finds complex 
geometrical structures. An example is illustrated in the sequence of zooms in Fig- 
ure VIII.240 (a)-(g). The structures appear to be subtly different from one another. 
Early experiments show that if A is chosen in the vicinity of one of these structures, 
then images of the “repelling set’ of the dynamical system {R?; f,}, computed us- 
ing the Escape Time Algorithm, contains similar structures. An example of such 
an image is shown in Figure VIII.241. The third region, at the lower right in Fig- 
ure VIII.239, is made up of closed contours of black, grey, and white. Here the map 
conveys little information about the family of dynamical systems. To obtain infor- 
mation in this region one should examine the orbits of a point Q, different from O. 
The fourth region, the outer white area in Figure VIII.239, corresponds to dynamical 
systems for which the orbit of O does not escape. It is likely that for A in this region, 
the “repelling set” of the dynamical system possesses an interior. 

Our new maps, such as Figure VIII.239, can provide information about the family 


of IFS 
{C; w,(z) =Az + 1, wo(z) =A*z— 1}, P= {A EC: |A| < 1} 


in the vicinity of the boundary of the Mandelbrot set. For A € 0M, the attractor of 
the IFS is the same as the repelling set of the dynamical system. For A close to dM, 
the attractor of the IFS “looks like” the repelling set of the dynamical system. 
Figure VIII.242 shows a transverse section through the anther of a lily. We include 
it because some of the structures in Figure VIII.240 (a)-(g) are reminiscent of cells. 
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Figure VIlI.241. Im- 
age of the repelling set 
for one of the family 

of the dynamical sys- 
tems whose parameter 
space was mapped in Fig- 
ure VIII.239. This image 
corresponds to a value of 
A that lies within the high- 
est power zoom in Fig- 
ure VIII.240. Notice how 
the objects here resemble 
those in the corresponding 
position in the parameter 
space. 


Figure VIII.242. Lon- 
gitudinal section through 
part of the stigma of a 
lily, showing germinating 
pollen-grains. h pappil- 
lae of stigma; p.g., pollen 
grains; t, pollen tubes. 
Highly magnified. (After 
DodelPort, [Scott 1917].) 
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Algorithm 1 in section 4 can be applied to families of dynamical systems of the 
type described in Theorem 4.1 in Chapter VII. For example, let {R?; f,}, where 
i. € P =m R’, denote a family of dynamical systems. Let X C R? be compact. Let 
f,, :X — R? be continuous and such that f(X) > X. Then f, possesses an invariant 
set A(A) € A(X), given by 


AG =Me5h UR. 


A(A) is the set of points whose orbits do not escape from X. The set of points in P 
corresponding to which orbit of Q does not escape from X is 


M(Q)={AEP: QEA(A)}. 


We conclude this chapter by giving an “explanation” of how family resemblances 
can happen between structures that occur on the boundary of M(Q) and the sets 
A(A). (1) Suppose that A(A) is a set in R? that looks like a map of Great Britain, 
translated by A. Then what does M(Q) look like? It looks like a map of Great 
Britain. (2) Suppose that A(A) is a set that looks like a map of Great Britain at time 
1, translated by 4. We picture the set A(A) varying slowly, perhaps its boundary 
changing continuously in the Hausdorff metric as A varies. Now A(A) looks like a 
deformed map of Great Britain. The local coves and inlets will be accurate represen- 
tations of those coves at about the time A, to which they correspond in the parameter 
space map. That is, the boundary of M(Q) will consist of neighboring bays and 
inlets at different times stitched together. It will be a map that is microscopically ac- 
curate (at some time) and globally inaccurate. (3) Now pretend in addition that the 
coastline of Great Britain is self-similar at each time A,. That is, imagine that lit- 
tle bays look like whole chunks of the coastline, at a given instant. Now what will 
M(Q) look like? At a given microscopic location on the boundary, magnified enor- 
mously, we will see a picture of a whole chunk of the coastline of Great Britain, at 
that instant. (4) Now imagine that for some values of A, Great Britain, in the dis- 
tant future, is totally disconnected, reduced to grains of isolated sand. It is unlikely 
that those values of A belong to M(Q). As A varies in a region of parameter space 
for which A(A) is totally disconnected, it is not probable that Q € A(A). In these 
regions we would expect M(Q) to be totally disconnected. 

The families of sets {A(A) € X : A € P} considered in this chapter broadly fit into 
the description in the preceding paragraph. Both P and X are two-dimensional. The 
sets A(A) are derived from transformations that behave locally like similitudes. For 
each A € P, A(A) is either connected or totally disconnected. Finally, the sets A(A) 
and their boundaries appear to depend continuously on A. 


Examples & Exercises 


4.1. In the above section we applied Algorithm 1 in section 4, with Q = (0, 0), to 
compute a map of the family of dynamical systems 
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ee (z—1)/A if Re z > 0, 
me eat i Re 

The resulting map was shown in Figure VIII.239. This map contains an unexplored 
region. Repeat the computation, but with (a) Q = 0.5, and (b) Q = —0.5, to obtain 
information about the unexplored region. 
4.2. In this example we consider the family of dynamical systems {C; f,}, where 
(z—1)/A if Agx —A1y 29, 
(z+1)/A if Aox — Ary <0. 
The parameter space is A € P = {A €C€:0 <|A| < 1}. This family is related to the 
family of IFS 


f(z) = 


{C; wi(z) =Az + 1, wo(z) = Az — 1}. 


Let A(A) denote the attractor of the IFS and let A(A) denote the “repelling set” 
associated with the dynamical system. Let 


S = {de P: the line A.x — A, y =0 separates w,(A(A)) and w2(A(A))}. 


If A € S then {A(A); fi} is the shift dynamical system associated with the IFS, and 
A(A) = A(A). Even when A ¢ S we expect there to be similarities between A(A) and 
A(A). 


In Figures VIII.243, VIII.244, and VIII.245 and Color Plates 19 and 20 we show 
some results of applying Algorithm 1 in section 4 to the dynamical system {C; /,}. 

In Figure VIII.243, the outer white region represents systems for which the orbit 
of the point O does not diverge, and probably corresponds to “repelling sets” with 
nonempty interiors. The inner region, defined by the patchwork of gray, black, and 
white sections, bounded by line segments, represents systems for which the orbit 
of O diverges and corresponds to totally disconnected “repelling sets.” The grainy 
gray area is the interesting region. This is the “coastline”; it is itself a fractal object, 
revealing infinite complexity under magnification. Figures VIII.244 and VIII.245 
show magnifications at two places on the coastline. The grainy areas revealed by 
magnification resemble pictures of the repelling set of the dynamical system at the 
corresponding values of A. : 


4.3. This exercise refers to the family of dynamical systems {C; z* — A}. Use Algo- 
rithm 1 in section 4 with —0.25 < A, < 2, —1 <A, <1, and Q = (0.5, 0.5) to make 
a picture of the “Mandelbrot Set” M(0.5, 0.5). An example of such a set, for a dif- 
ferent choice of Q, is shown in Figure VIII.246. 
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Figure VIII.243. A 
map of the family of dy- 
namical systems described 
in example 4.2, computed 
using Algorithm 1 in sec- 
tion 4. The parameter 
space is P= {A EC;0< 
A, < 10<A,< 1}. The 
gray grainy area is the in- 
teresting region. This is 
the “coastline”; it is itself 
a fractal object, revealing 
infinite complexity under 
magnification. 


Figure VIII.244. Zoom 
on a small piece of 

the foggy area in Fig- 
ure VIII.242. In it one 
finds grainy areas that re- 
semble the repelling sets 
of the corresponding dy- 
namical systems. At what 
value of A does one find 
them? At the value of A 
in the map where the pic- 
ture you are interested in 
Occurs. 
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Figure VIII.245. Zoom 
on a small piece of 

the foggy area in Fig- 
ure VIII.242. The grainy 
areas in this picture here 
have different shapes from 
those in Figure VIII.244. 


Figure VIII.246. A 
“Mandelbrot set” M (Zp) 
associated with the family 
of dynamical systems 

{C; z* — A}. This was 
computed using escape 
times of orbits of the point 
Z = 2 different from the 
critical point, z = 0. 


Chapter IX 
Measures on Fractals 
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Introduction to Invariant Measures on Fractals 


In this section we give an intuitive introduction to measures. We focus on measures 
that arise from iterated function systems in R?. 

In Chapter III, section 8 we introduced the Random Iteration Algorithm. This 
algorithm is a means for computing the attractor of a hyperbolic IFS in R?. In order 
to run the algorithm one needs a set of probabilities, in addition to the IFS. 


Definition 1.1 An iterated function system with probabilities consists of an IFS 
{X; w1, W2,.--, Wy} 
together with an ordered set of numbers {p, P2, ..., Pw}, Such that 


Pi t+ pot+ p3+-::-+ pn =1and p; > 0 lori = 1.2... 


The probability p; is associated with the transformation w;. The nomenclature 
“IFS with probabilities” is used for “iterated function system with probabilites.” The 
full notation for such an IFS is 


{X; wi, W2,.-., WN; Pi, P2,---» Pn}. 


Explicit reference to the probabilities may be suppressed. 
An example of an IFS with probabilities is 


{C; w1(z), wo(z), w3(z), w4(z); 0.1, 0.2, 0.3, 0.4}, 


where 
w(z) = 0.5z, w2(z) = 0.5z + 0.5, 
w3(z) = 0.5z + (0.5)i, wa(z) = 0.5z + 0.5 + (0.5)i. 
It can be represented by the IFS code in Table [X.1. The attractor is the filled square 
w, with corners at (0, 0), (1, 0), (1, 1), and (0, 1). 
Here is how the Random Iteration Algorithm proceeds in the present case. An 
initial point, z € C, is chosen. One of the transformations is selected ‘“‘at random” 
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Table IX.1. IFS code for a measure on @. 


Se 


be O5-:-0. 0. O5 1 1-;-- 0.1 
2. 00. 0 6 O35 30 1.02 
= 0. U.S ee | eS 
£2205 - 02 0:05 550° 50: 04 


from the set {w,, w2, w3, ws}. The probability that w; is selected is p;, for 1 = 
1, 2, 3,4. The selected transformation is applied to z to produce a new point z; € 
C. Again a transformation is selected, in the same manner, independently of the 
previous choice, and applied to z; to produce a new point z). The process is repeated 
a number of times, resulting in a finite sequence of points {z, :n = 1,2,...,numits}, 
where numits is a positive integer. For simplicity, we assume that zo € m. Then, since 
w;(™) Cm, fori = 1, 2, 3, 4, the “orbit” {z,:n = 1,2, ...,numits} lies in @. 
Consider what happens when we apply the algorithm to the IFS code in Table 
IX.1. If the number of iterations is sufficiently large, a picture of m will be the 
result. That is, every pixel corresponding to m is visited by the “orbit” {z,:n = 


1,2, ...,numits}. The rate at which a picture of m is produced depends on the prob- 
abilities. If numits = 10, 000, then we expect that because the images of m are just- 
touching, 


the number of computed points in w;(@) © 1000, 
the number of computed points in w2(m™) © 1000, 
the number of computed points in w3(@) ~ 1000, 
the number of computed points in w4(@) ~ 1000. 


These estimates are supported by Figure IX.247, which shows the result of running 
a modified version of Program 2 in Chapter III, with the IFS code in Table [X.1, and 
numits = 100, 000. 

In Figure IX.248 we show the result of running a modified version of Program 2 
in Chapter III, for the IFS code in Table IX.1, with various choices for the proba- 
bilities. In each case we have halted the program after a relatively small number of 
iterations, to stop the image from becoming “saturated.” The results are diverse tex- 
tures. In each case the attractor of the IFS is the same set, m. However, the points 
produced by the Random Iteration Algorithm “rain down” on # with different fre- 
quencies at different places. Places where the “rainfall” is highest appear “darker” 
or “more dense” than those places where the “rainfall” is lower. In the end all places 
on the attractor get wet. 

The pictures in Figure IX.248 (a)(c) suggest a wonderful idea. They suggest 
that associated with an IFS with probabilities there is a unique “density” on the 
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Figure 1X.247. The 
Random Iteration Algo- 
rithm, Program | in Chap- 
ter III, is applied to the 
IFS code in Table I[X.1, 
with numits = 100,000. 
Verify that the number of 
points that lie in w;(@) is 
approximately (numits) p;, 
fore = 7, 2 34, 


attractor of the IFS. The Random Iteration Algorithm gives one a glimpse of this 
“density,” but one loses sight of it as the number of iterations is increased. This is 
true, and much more as well! As we will see, the “density” is so beautiful that we 
need a new mathematical concept to describe it. The concept is that of a measure. 
Measures can be used to describe intricate distributions of “mass” on metric spaces. 
They are introduced formally further on in this chapter. The present section provides 
an intuitive understanding of what measures are and of how an interesting class of 
measures arises from IFS’s with probabilities. 
As a second example, consider the IFS with probabilities 


{C; w1(z) = 0.5z + 24 + 24, wo(z) = 0.52 + 24, w3(z) = 0.5z; 0.25, 0.25, 0.5}. 


The attractor is a Sierpinski triangle A. The probability associated with w3 is twice 
that associated with either w; or w2. In Figure IX.249 we show the result of applying 
the Random Iteration Algorithm, with these probabilities, to compute 10,000 points 
belonging to A. There appear to be different “densities” at different places on A. For 
example, w3(A) appears to have more “mass” than either w,;(A) or w7(A). 

In Figure IX.250 we show the result of applying the Random Iteration Algorithm 
to another IFS with probabilities, for three different sets of probabilities. The IFS 
is {R?; w,, w2, w3, w4}, Where w;, is an affine transformation for i = 1, 2, 3, 4. The 
attractor of the IFS is a leaf-like subset of R?. In each case we see a different pattern 
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WR RONG orien oss = - Figure IX.248. The 
eis, Me Sar, A a : * Random Iteration Algo- 
rithm is applied to the IFS 
code in Table [X.1, but 
with various different sets 
of probabilities. The result 
is that points rain down on 
the attractor of the IFS at 
different rates at different 
places. What we are see- 
ing are the faint traces of 
wonderful mathematical 
entities called measures. 
These are the true fractals. 
Their supports, the attrac- 
tors of IFS, are merely sets 
upon which the measures 
live. 


Figure IX.248. (b) 
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Figure IX.248. (c) 


of “mass” on the attractor of the IFS. It appears that each “density” is itself a fractal 
object. 


Examples & Exercises 


1.1. Carry out the following numerical experiment. Apply the Random Iteration 
Algorithm to the IFS code in Table IX.1, for numits = 1000, 2000, 3000, .... In 
each case record the number, \’, of computed points that land in B = {(x, y) € 
IR? : (x — 1)? + (y — 1)? < 1}, and make a table of your results. Verify that the ratio 
N /numits appears to approach a constant. 


1.2. Repeat the computergraphical experiment that produced Figure IX.247. Verify 
that you obtain “similar-looking” output to that shown in Figure [X.247, even though 
you (probably) use a different random-number sequence. 


1.3. The Random Iteration Algorithm is used to compute 100,000 points belonging 
to m, using the IFS code in Table [X.1. How many of these points, do you expect, 
would belong to w; o w3(@)? Why? 


Let (X, d) be a complete metric space. Let {X; w;,..., wy; p,..., Dw} be an 
IFS with probabilities. Let A denote the attractor of the IFS. Then there exists a 
thing called the invariant measure of the IFS, which we denote here by jz. jz assigns 
“mass” to many subsets of X. For example, (A) = 1 and (8) = 0. That is, the 
“mass” of the attractor is one unit, and the “mass” of the empty set is zero. Also 
p(X) = 1, which says that the whole space has the same “mass” as the attractor of 
the IFS; the “mass” is located on the attractor. 
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Not all subsets of X have a “mass” assigned to them. The subsets of X that do 
have a “mass” are called the Borel subsets of X, denoted by B(X). The Borel subsets 
of X include the compact nonempty subsets of X, so that #(X) C B(X). Also, if O is 
an open subset of X, then O € B(X). So there are plenty of sets that have “mass.” Let 
B denote a closed ball in X. Here is how to calculate the “mass” of the ball, w(B). 
Apply the Random Iteration Algorithm to the IFS with probabilities, to produce a 
sequence of points {Z,}P°5. Let 


N(B,n) = number of points in {2o, 21, Z2, 23,---, 2n} 1B, forn =0, 1, 2,.... 


Then, almost always, 
N(B,n) 
ae ly ———— } , 
ue) tiie (n + 1) 


That is, the “mass” of the ball B is the proportion of points, produced by the Random 
Iteration Algorithm, which land in B. (To be precise we also have to require that the 
“mass” of the boundary of B is zero; see Corollary 7.1.) 

By now you should be bursting with questions. How do we know that this formula 
“almost always” gives the same answer? What are Borel sets? Why don’t all sets 
have “mass”? Welcome to measure theory! 

As an example, we evaluate the measure of some subsets of C, for the IFS with 
probabilities 


Figure IX.249. The 
Random Iteration Algo- 
rithm is used to compute 
an image of the Sierpinski 
triangle A. The probabil- 
ity associated with w3 is 
twice that associated with 
Ww, Or wz. One thousand 
points have been com- 
puted. The result is that 
w3(A) appears denser than 
w,(A) or w2(A). This ap- 
pearance is lost when the 
number of iterations is in- 
creased. We are led to the 
idea of a “mass” or mea- 
sure that is supported on 
the fractal. 
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Figure IX.250. The Random Iteration Algorithm is used to compute an image of a leaf. 
Different sets of probabilities lead to different distributions of “mass” on the leaf. 


{C; w,(z) = 0.5z, w2(z) = 0.5z + (0.5)i, w3(z) = 0.5z + 0.5; 0.33, 0.33, 0.34}. 


The attractor is a Sierpinski triangle A with vertices at 0, i, and 1. We compute the 
measures of the following sets: 


B, ={zeEC: |z| < 0.5} 
B, = {ze C: |z — (0.5 + 0.57)| < 0.2} 
Bz = {z EC: |z — (0.5 + 0.57)| < 0.5} 


Bs = {zeEC:|z—-(2+i)| < V2}. 


The results are presented in Table IX.2. 
Figure IX.251 illustrates the ideas introduced here. 


Examples & Exercises 
1.4. Explain why (B,4) © 0 in Table IX.2. 
1.5. What value, approximately, would have been obtained for 4(B;) in Table 


IX.2, if the probabilities on the three maps had been p, = 0.275, p2 = 0.125, and 
p3 = 0.5? 


2 Fields and Sigma-Fields 


Table IX.2. The measures of some subsets of A are computed by random iteration. 


n N(B,,n)/n N(Bz,n)/n N(B3,n)/n = N(Ba,n)/n 
5,000 0.3313 0.1036 0.6385 0.0004 
10,000 0.3314 0.1050 0.6500 0.0002 
15,000 0.3323 0.1041 0.6512 0.0001 
20,000 0.3330 0.1030 0.6525 0.0000 
50,000 0.3326 0.1041 0.6527 0.0000 
100,000 0.3325 0.1054 0.6497 0.0000 


Figure IX.251. Di- 
agram of the Random 
Iteration Algorithm run- 
ning, and a dancing point 
coming and going from 
the ball B. The “mass” 
or measure of the ball is 
L.(B). It is equal to the 
proportion of points that 
land in B. 


The “mass” of B is the 
proportion of time 
spent in B 


a “i : 
Random Iteration Algorithm produces 
a dancing point. 


i DDD DB DD. DB. DB. DB, | 


1.6. Why, do you think, is the phrase “almost always” written in connection with 
the formula for 4(B), given above? 
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2 Fields and Sigma-Fields 


Definition 2.1 Let X be a space. Let F denote a nonempty class of subsets of a 
space X, such that | 


(1) A,BBEFSAUBEF; 
(2) AEF S>X\AEFf. 
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Then F is called a field. 

(In exercise 2.12 you will be asked to prove that X € F.) 

Theorem 2.1 Let X be a space. Let G be a nonempty set of subsets of X. Let F 
be the set of subsets of X which can be built up from finitely many sets in G using the 


operations of union, intersection, and complementation with respect to X. Then F is 
a field. 


Proof Elements of F consist of sets such as 
X \ (((X \ (Gi U G2)) N G3) U (G5 1M Ge)), 


where G,, G2, G3, G3,... denote elements of G. That is, F is made of all those 
sets that can be expressed using a finite chain of parentheses, \, U, 1, elements of 
G, and X. (In fact, using de Morgan’s laws one can prove that it is not necessary to 
use the intersection operation.) If we form the union of any two such expressions 
we obtain another one. Similarly, if we form the complement of such an expression 
with respect to X, we obtain another such expression. So conditions (i) and (ii) in 
Definition 2.1 are satisfied. This completes the proof. 


Definition 2.2 The field referred to in Theorem 2.1 is called the field generated 
by G. | 


Examples & Exercises 
2.1. Let X be a space and let A C X. Then ¥ = {X, A, X \ A, 9} is a field. 


2.2. Let X be the set of all leaves on a certain tree and let F be the set of all subsets 
of X. Then F is a field. Let A denote the set of all the leaves on the lowest branch of 
the tree. Then A € ¥. Prove that F is generated by the leaves. 


2.3. Let X = [0, 1] C R. Let G denote the set of all subintervals (open, closed, half- 
open) of [0, 1]. Let F denote the field generated by G. Examples of members of F 
are [0.5, 0.6) U (0.7, 0.81); [0, 1]; [1, 1]; and (5, 1) U (4, 3) U--- U (s&, gg). Show 


that 
1 1 1 es ae 
© ses — =; 5 ee EUs 
odes 3 i Gi 4 (; :) 
is a subset of X but it is not a member of F. 


2.4. Let X=mcC R?’. Let G denote the set of closed rectangles contained in X, 
whose sides are parallel to the coordinate axes and whose corners have rational 
coordinates. Let F denote the field generated by G. An example of an element of 
F is 

((@@\ (@\ Ri) U Ro) Rs) U (RGN @\ Rs)), 


where Rj, Ro, R3, Ra, and Rs are rectangles in G. Let S € ¥. Prove that the area of 
S is a rational number. Deduce that F does not contain the ball B(O, 1) = {(x, y) € 
mix? + y? <1}. 
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ry 
wV¥v 


A A 


OA BA 
&2 2 iw, & 
Bx b> As A 


2.5. Let X denote the set of pixels corresponding to a certain computer graphics 
display device. The set of all monochrome images that can be produced on this 
device forms a field. Figure [X.252 shows an example of a small field of subsets of 
X. It is generated by the pair of images, G; and Gz, in the middle of the second row, 
together with the set X. X is represented by the black rectangle. The empty set is 
represented by a blank screen. Find formulas for all of the images in Figure [X.252, 
in terms of G,, G2, and X. 


2.6. Let X= denote the code space on two symbols 1 and 2. Letn € {1, 2,3, ...} and 
#611, Zitori= | Z,...,8.ie 


Cle}, 62, anchegee Ze bw; =e; fori=t,2, «..2}. 


Any subset of © that can be written in this form is called a cylinder subset of &. Let 
F denote the field generated by the cylinder subsets of &. Find a subset of & that is 
not in F. 


2.7. Let X be a space. Let F denote the set of all subsets of X. The customary 
notation for this field is F = 2*. Show that F is a field. 


Definition 2.3 Let F be a field such that 
Aj € F fori €1,2,3,.5:.3 04; ef. 


Figure IX.252. A field 
whose elements are sets 
of pixels. Can you find 
two elements of the field 
which generate the field? 
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Then F is called a o-field (sigma-field). 
Given any field, there always is a minimal, or smallest, o-field which contains it. 


Theorem 2.2 Let X be a space and let G be a set of subsets of X. Let {F,:a € 
1} denote the set of all a -fields on X which contain G. Then F = NaF, is a o-field. 


Proof Note that there is at least one o-field that contains G, namely 2%*, the field 
consisting of all subsets of X. We have to show that N.F, is ao-field if each F, is a 
o-field that contains G. Suppose that A; € Ng F,q; then, for each a, A; is an element 
of the o -field F, and so U2, Aj € Fa. Suppose A € No Fq; then, foreach a, A € Fy 
and so X \ A € Fy. Hence X \ A € Mg Fq. This completes the proof. 


Definition 2.4 Let G be a set of subsets of a space X. The minimal o -field which 
contains G, defined in Theorem 2.2, is called the o-field generated by G. 


Definition 2.5 Let (X, d) be a metric space. Let B denote the o -field generated 
by the open subsets of X. B is called the Borel field associated with the metric space. 
An element of B is called a Borel subset of X. 


The following theorem gives the flavor of ways in which the Borel field can be 
generated. 


Theorem 2.3 Let (X, d) be a compact metric space. Then the associated Borel 
field B is generated by a countable set of balls. 


Proof We prove a more general result first. Let G = {b, C X:n = 1, 2,3, 
...;D, open } be a countable base for the open subsets of X. That is, every open 
set in X can be written as a union of sets in G. Then B is generated by G. To see 
this, let B denote the o-field generated by G. Then B c B because G is contained in 
the set of open subsets of X. On the other hand, B Cc B because B contains all the 
generators of B. Hence B = B. 

It remains to construct a countable base for the open subsets of X using balls. For 
R > Olet 


B(x, R)={y € X: d(x, y) < R}. 


Let n be a positive integer. Then X = U;,ex B(x, 4), Hence {B(x, +) : x € X} is an 
open covering of X. Since X is compact it contains a finite subcovering {B(x”, +) : 
m=1,2,..., M(n)} for some integer M(n). We claim that 


1 
D = (B(x, ~)im=1,2,...,M(n);n=1,2,3,...} 


is a countable base for the open subsets of X. Let O be an open subset of X, and let 
x € ©. Then there is an open ball, of radius R > 0, such that B(x, R) c O. Let n be 
large enough that + < 2. Then there is m € {1, 2, ..., M(n)} so that x is in the ball 
B(x"), +), and this ball is contained in O. Each x in O is contained in such a ball, 
belonging to D. Hence D is indeed a countable base for the open subsets of X. This 
completes the proof. 


Examples & Exercises 


2.8. Let B denote the o-field generated by the field in exercise 2.4. Then B contains 
the ball B(O, 1). Similarly it contains all balls in a C R?. Show that B is the Borel 
field associated with ( m, Manhattan). 


2.9. Let © denote the code space on the two symbols {0, 1}. Show that the Borel 
field associated with (, code space metric) is crasitige by the cylinder subsets of 
xX, defined in exercise 2.5. 


2.10. Let A C R? denote a Sierpinski triangle. Let G denote the set of connected 
components of R* \ A. Let F denote the o-field generated by G. Show that F is 
contained in, but not equal to, the Borel field associated with (R?, Euclidean). 


2.11. Let X be a space and let G be a set of subsets of X. Let F; be the field 
generated by G, let F, be the o-field generated by G, and let F3 be the o-field 
generated by F;. Prove that 73 = F>. 


2.12. Let F be a field of subsets of a space X. Prove that X € F. 


3 Measures 


34] 


3 Measures 


A measure is defined on a field. Each member of the field is assigned a nonnegative 
real number, which tells us its “mass.” 

Definition 3.1 A measure pz, on a field F, is a real nonnegative function pw: 
F — [0, 00) C R, such that whenever A; € F fori =1,2,3,..., with A; N Aj = 
fori # j and UX,A; € F, we have 


(UP, Ai) = >) u(A)). 


i=] 
(In other texts a measure as defined here is usually referred to as a finite measure.) 


Definition 3.2 Let (X, d) be a metric space. Let B denote the Borel subsets of 
X. Let u be a measure on B. Then wp is called a Borel measure. 


Some basic properties of measures are summarized below. 

Theorem 3.1 Let F be a field and let 1: F — R be a measure. Then 

(1) If BD A, then p(B) = w(B \ A) + w(B), for A, B € F; 

(2) If B> A, then u(B) = (A); 

(3) p(B) = 

(4) If A; € F fori =1,2,3,...,and U%,A; € F, then u (U%, Ai) < 
> LL(A;); 

(5) If {A; € F} obeys Ay C Az C A3C..., and if U2,Ai € F, then u(Aj) > 
p(U72, Ai). 
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(6) If {A; € F} obeys Ay D Az > A3D..., and if N72, Ai € F, then p(A;i) > 
w(NP2 Ai). 


Proof [Rudin 1966] Theorem 1.19, p. 17. These are fun to prove for yourself! 


We are concerned with measures on compact subsets of metric spaces such as 
(IR?, Euclidean). The natural underlying o-field is the Borel field, generated by the 
open subsets of the metric space. The following theorem allows us to work with the 
restriction of the measure to any field that generates the o-field. 


Theorem 3.2 [Caratheodory] Let denote a measure on a field F. Let F 
denote the o-field generated by F. Then there exists a unique measure jt on F such 
that (A) = f(A) for all A € F. 


Sketch of proof The proof can be found in most books on measure theory; see 
[Eisen 1969] Theorem 5, p. 180, Chapter 6, for example. First jz is used to define an 
“outer measure” 2° on the set of subsets of X. ° is defined by 


ie. ¢) 

u°(A) = inf > H(An) AC UR, An, An € FVn € Z| 
n=1 

° is not usually a measure. However, one can show that the class F° of subsets A 

of X such that—this was Caratheodory’s smart idea— 


w(E) = p(ANE)+p°(X\ ANE) forall Ee 2* 
is a o-field that contains F. One can also show that 2° is a measure on F°. Note that 


PDF. jt is defined by restricting 2° to F. Finally one shows that this extension of 
us. to F is unique. This completes the sketch. 


In the above sketch we have discovered how to evaluate the extended measure ji 
in terms of its values on the original field. 


Theorem 3.3 Leta measure u ona field F be extended to a measure jt on the 
minimal o -field F that contains F . Then, for all A € Ff, 


co 
i(A) = inf |) w(An): A C US An, An € Fn = dro 


n=1 


Examples & Exercises 


3.1. Consider the field F = {X, A, X \ A, 0}, where A #4 X and A #@. A measure 
yu: F > R is defined by w(X) = 7.2, w(A) = 3.5, w(X \ A) = 3.7, and w(@) = 
0. F is also a o-field. The extension of the measure promised by Caratheodory’s 
theorem is just the measure itself. 


3.2. Let F be the field made of sets of leaves on a certain tree, at a certain instant 
in time, and let (A) be the number of aphids on all the leaves in A € ¥. Then y is 
a measure on a finite o -field. 


3.3. Let X =[0,1] C R. Let F be the field generated by the set of subintervals 
of [0, 1]. Let a, b € [0, 1] and define (a, b)) = w([a, b]) = b — a, for a < b; and 
more generally let 


jz(element of F) = sum of lengths of subintervals which comprise the element. 


Show that yz is a measure on F . The o-field F generated by F is the Borel field for 
({0, 1],Euclidean). Show that S = {x € [0, 1]: x is a rational number } belongs to F 
but not to F. Evaluate 2(S), where jz is the extension of yz to F. 


3.4. Let X = X, the code space on the two symbols 1 and 2. Let F denote the field 
generated by the cylinder subsets of X, as defined in exercise 2.5. Let 0 < p; < 1 
and p2 = 1 — p;. Define 

w(C (1, €2,..- 5 €n)) = De; Pe +++ Pens 


for each cylinder subset C (e), €2, ..., @n) of &. Show how yz can be defined on the 
other elements of F in such a way as to provide a measure on ¥. Evaluate 


u({x € & : x7 = 1}) and w(X). 
Extend F to the field F generated by F, and correspondingly extend yu to 1. Show 
that 
S={x€ Lixo = leF 
and evaluate j(S). 
3.5. This example takes place in the metric space ((0, 1], Euclidean). Consider the 
IFS with probabilities 


1 1 a 
{{0, 1]; wi(x) = 3 w2(x) = 37 + 3° Pi» P2}. 


Let F denote the field generated by the set of intervals that can be expressed in the 
form 


We, 0 We, 0... 0 We, ([0, 1)), 
where n € {1,2,...} and e; € {1,2} for each i =1,2,...,n. Let 0 < p; <1 and 
P2 = 1— p;. Show that one can define a measure on ¥ so that, for every such 
interval, 
[L(We, 0 We, 0+ 0 we, ([0, 1})) = Pe, Pez - ++ Den: 
Let A denote the attractor of the IFS. Evaluate (A), 4(X \ A), and u([4, $]). 


3.6. What happens in exercise 3.5 if the IFS is replaced by 
1 1 
{[0, 1]; w(x) = sm, w2(x) = 5% +53 Ph P2}? 
For what value of p; is the extension of the measure to the o-field generated by F 
the same as the Borel measure defined in exercise 3.3? 
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Definition 3.3 Let (X, d) be a metric space, and let 1. be a Borel measure. Then 
the support of wu is the set of points x € X such that u(B(x, €)) > 0 for all € > 0, 
where B(x, €) ={y € X: d(y, x) < €}. 


The support of a measure is the set on which the measure lives. The following is 
an easy exercise. 


Theorem 3.4 Let (X, d) be a metric space, and let u be a Borel measure. Then 
the support of mw is closed. 


Examples & Exercises 


3.7. Let (X, d) be a compact metric space and let yz be a Borel measure on X such 
that 4(X) 4 0. Show that the support of uw belongs to 71(X), the space of nonempty 
compact subsets of X. 


3.8. Prove the following. “Let ~ be a measure on a o-field F, and let F be the 
class of all sets of the form A U B where A € F and B is a subset of a set of measure 
zero. Then F is ao-field and the function 7 : F —> R defined by 7Z(A U B) = 71(A) 
is a measure.” The measure j referred to here is called the completion of uw. The 
completion of the measure in exercise 3.3 is called the Lebesgue measure on [0, 1]. 


4 Integration 


In the next section we will introduce a remarkable compact metric space. It is a space 
whose points are measures! In order to define the metric on this space we need to 
be able to integrate continuous real-valued functions with respect to measures. Can 
one integrate a continuous function defined on a fractal? How does one evaluate the 
“average” temperature of the coastline of Sweden? Here we learn how to integrate 
functions with respect to measures. Let (X, d) be a compact metric space. Let jz be 
a Borel measure on X. Let f : X — R be a continuous function. We will explain the 
meaning of integrals such as 


| f (x) du(x). 
x 


Definition 4.1 We reserve the notation x4 for the characteristic function of a 
set A C X. It is defined by 


ne 1 for xEA, 
ROOT fox SERA 


A function f : X — R is called simple if it can be written in the form 


N 
f(x) = > wixn@), 
i=1 


i= 


where N is a positive integer, I; € B and y, € R fori =1,2,...,N, UN]; =X, 
and 1; 1; =@ fori # j. 


The graphs of several simple functions, associated with different spaces, are 
shown in Figures [X.253 and IX.254. 


Definition 4.2 The integral (with respect to w) of the simple function f in 
Definition 4.1, is 


N 
[ f(x) du(x) = [ fdp= > vin). 
i=] 


Examples & Exercises 


4.1. Let f : [0,1] — R be a piecewise constant function, with finitely many dis- 
continuities. Show that f is a simple function. Let 4. denote the Borel measure on 
[0, 1] such that (J) = length of J, when / is a subinterval of [0, 1]. Show that 


1 
[ f(x) dx = | f (x) du(x), 
0 [0,1] 


where the right-hand side denotes the area under the graph of /. 


4.2. This example takes place in the metric space (@, Euclidean). Let G denote the 
set of rectangular subsets of m. Let F denote the field generated by G. Show that 
there is a unique measure yz on F such that (A) = area of A, for all A € G. Notice 
that the o-field generated by F is precisely the Borel field B associated with (a, 
Euclidean). Let 2 denote the extension of yz to B. Let A denote a Sierpinski triangle 
contained in m. Show that A é€ B, and 


[xa aa = ia) =o. 


4.3. This example concerns the IFS with probabilities 
{C; w1(z), wo(z), w3(Z); pi = 0.2, p2 = 0.3, p3 = 0.5}, 
where 
w)(z) =0.5z, w2(z) = 0.5z + (0.5)i, w3(z) = 0.5z + 0.5. 


Let A denote the attractor of the IFS, and B the Borel subsets of (A, Euclidean). Let 
4 denote the unique measure on B such that 
u(A)=1 
1.(w;(A)) = pj for i € {1, 2, 3}; 
(ww; o w;(A)) = pip; fort; je {f, 2, 3}; 


[(w; 0 Wj +++ 0 We(A)) = Pi Pj... Pk fori, j,.3. ,k € U1, 2,3); 
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Figure IX.253. The 
graph of a simple function 
on a Sierpinski triangle. 
The domain is a Sierpinski 
triangle in the (x, y) 
plane. The function values 
are represented by the z- 
coordinates. 


Figure IX.254. The 
graph of a function whose 
domain is a fractal fern. 
If, instead, the function 
values were represented 
by colors, a painted fern 
would replace the graph. 


Define a simple function on A by 


forx +iy€ Aand1/3<x <1, 


fe 4i= | 
y=) -1 forx +iy € A and0 <x <2/3. 


Calculate f A J (z)du(z), accurate to two decimal places. 


Based on the ideas in section 1 of this chapter, can you guess a method for 
calculating the integral that makes use of the Random Iteration Algorithm? Try it! 


4.4. Show that if a, 8B € R and f, g are simple functions then af + Bg is a simple 
function, and 


af fau+p [ sdu= | (of +s)du. 


4.5. Black ink is printed to make this page. Let m C R* be a model for the page, 
and represent the ink by means of a Borel measure jy, so that (A) is the mass of ink 
associated with the set A Cm. Let A € F denote the smallest Borel set that contains 
all of the letters “a’’on the page. Assume that the total mass of ink on the page is one 
unit. Estimate fj, x4du. 


4.6. Let & denote code space on two symbols {1, 2}. B denotes the Borel field 
associated with (Xx, code space metric). Consider the IFS {2; w,(x) = 1x, w2(x) = 
2x; p; = 0.4, p2 = 0.6}, where “1x” means the string “1x,x2x3...” and “2x” means 
the string “2x)x2x3....’ The attractor of the IFS is &. Let wu denote the unique 
measure on B such that 
[L(w; OW; +++ 0 We(X)) = Pip; .-- Dk fori, j,..x,.& © (1, 2}. 

Define sets A and B in B by 

A={xe€B:x,=O}and B={x €B:x,=1}. 
Define f : & > R by 

f(x) = xa(x) + (2.3) xB (X) for all x € Xd. 


Evaluate the integral 
| f(x) du(x). 
x 


Definition 4.3 Let (X,d) be a compact metric space, and let B denote the 
associated Borel field. Let u be a Borel measure. A partition of X is a finite set of 
nonempty Borel sets, {A; € B:i =1,2,..., M}, such that X =U", Aj, and Aj 
A; =@ for i # j. The diameter of the partition is max{sup{d(x, y):x, y€ Aj} :i= 
L2..., Mi. 

Theorem 4.1] Let (X,d) be a compact metric space. Let B denote the asso- 


ciated Borel field. Let u be a Borel measure on X. Let f :X — R be continuous. 
(i) Let n be a positive integer. Then there exists a partition B, ={Anm€B:m= 


4 
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1,2,..., M(n)} of diameter 1/n. (ii) Let Xn,m € An,m form = 1,2,3,..., and define 
a sequence of simple functions by 

M(n) 

flD= > fGawre, (2) fora~1,2 3... 

m=1 
Then { f,} converges uniformly to f (x). (iti) The sequence { Se fndu} converges. (iv) 
The value of the limit is independent of the particular sequence of partitions, and of 
the choices of Xn.m € An.m- 


Sketch of proof 


(i) Since X is compact it is possible to cover X by a finite set of closed balls 
of diameter 1/n, say bn, bn,2,---,9n,mM(n). We can assume that each ball 
contains a point not in any of the other balls. Then define A, ; = 5,,1, and 
An,j = bnj \ UZ} Ane for j = 2,3, ...,M(n). Then B, = {Anm € B:m = 
1,2,..., M(n)} is a partition of X of diameter 1/n. 

(ii) Let € > 0. f is continuous on a compact space, so it is uniformly continuous. 
It follows that there exists an integer N(€) so that if x, y € X and d(x, y) < 
1/N(e) then | f(x) — f(y)| <€. It follows that | f(x) — f,(x)| < € when 
n> Ne). 

(iii) It is readily proved that { tx fndu} is a Cauchy sequence. Indeed, for all 
n,m > N(e€) we have 


Lf trau— f toa 


It follows that the sequence converges. 

(iv) Let { f,} be a sequence of simple functions, constructed as above. Then there 
is an integer N(e) such that | f(x) — fa(x)| <« when n > N(e). It follows 
that for all n > max{N(e), N(6)}, 


[ tan f fod 


This completes the sketch of the proof. 


= [fn — Sm| dpe < 2ep(X). 
x 


< [ fs — Frldu < 2en(X). 


Definition 4.4 The limit in Theorem 4.1 is called the integral of f (with respect 
to ). It is denoted by 


lim [ fedu =f faw. 
n->oo xX 4 


Examples & Exercises 


4.7. Let (X, d) be a metric space. Let a € X. Define a Borel measure 5, by 5,(B) = 
1 if a € B and 6,(B) = 0 if a ¢ B, for all Borel sets B C X. This measure is referred 
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to as a “a delta function” and “a point mass at a.” Let f : X — R be continuous. 
Show that 


[ Or a eeies 


4.8. This example takes place in the metric space (@, Euclidean). Let yw be the 
measure defined in exercise 4.2, and define f :m—> R by f(x, y) =x? + 2xy 43. 


Evaluate 
f du. 
ae 


4.9. Make an approximate evaluation of the integral f A x*d(x) where w and A 
are as defined in exercise 4.3. 


4.10. Let X denote the set of pixels corresponding to a certain computer graphics 
display device. Define a metric d on X so that (X, d) is a compact metric space. Give 
an example of a Borel subset of X and of a nontrivial Borel measure on X. Show that 
any function f : X — R is continuous. Give a specific example of such a function, 
and evaluate f, fdw. 


5 The Compact Metric Space (P(X), d) 


We introduce the most exciting metric space in the book. It is the space where 
fractals really live. ; 


Definition 5.1 Let (X, d) be a compact metric space. Let 1: be a Borel measure 
on X. If w(X) = 1, then p is said to be normalized. 


Definition 5.2 Let (X,d) be a compact metric space. Let P(X) denote the set 
of normalized Borel measures on X. The Hutchinson metric dy on P(X) is defined 
by 


diy(p, v) = sup | [ fdu -| f dv: f:X— Rf continuous, 
x Xx 


f(x) — f(y)| < d(@, y)Vx, ye x}. 


for all u,v € P(X). 


Theorem 5.1 Let (X,d) be a compact metric space. Let P(X) denote the set 
of normalized Borel measures on X and let dy denote the Hutchinson metric. Then 
(P(X), dy) is a compact metric space. 


Sketch of proof A direct proof, using the tools in this book, is cumbersome. 
It is straightforward to verify that dy is a metric. It is most efficient to use the 
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concept of the “weak topology” on P(X) to prove compactness. One shows that this 
topology is the same as the one induced by the Hutchinson metric, and then applies 
Alaoglu’s theorem. See [Hutchinson 1981] and [Dunford 1966]. 


Examples & Exercises 


5.1. Let K be a positive integer. Let X = {(i, j): i, j =1,2,..., K}. Define a 
metric on X by d((i1, j1), (i2, j2)) = ir — d21 + |i — Jol. Then (X, d) is a compact 
metric space. Let jz € P(X) be such that y((i, j)) = (i + j)/(K? + K’) and let v € 
P(X) be such that v(i, 7) = 1/K?, for all i, j € {1,2,..., N}. Calculate dy(w, v). 


5.2. Let (X, d) be acompact metric space. Let 1 € P(X). Prove that the support of 
wt belongs to 71((X). 


eR LL 


6 A Contraction Mapping on ?(X) 


Let (X, d) denote a compact metric space. Let B denote the Borel subsets of X. Let 
w : X — X be continuous. Then one can prove that w—! : B > B. It follows that if v 
is a normalized Borel measure on X then so is v o w~'. In turn, this implies that the 
function defined next indeed takes P(X) into itself. 


Definition 6.1 Let (X, d) be a compact metric space and let P(X) denote the 
space of normalized Borel measures on X. Let 


{X; wi, W2,..., WN; Pi, P2, +--+» PN} 


be a hyperbolic IFS with probabilities. The Markov operator associated with the IFS 
is the function M : P(X) — P(X) defined by 


M(v) = pivow,! + ppevo ws) +--+ + pwvowy’ 


for all vy € P(X). 


Lemma 6.1 Let M denote the Markov operator associated with a hyperbolic 
IFS, as in Definition 6.1. Let f :X — R be either a simple function or a continuous 
function . Let v € P(X). Then 


N 
[ ramon =Yon. ff owiav. 


i=l 


Proof Suppose that f : X > R is continuous. By Theorem 5.1 we can find a 
sequence of simple functions {f,} which converges uniformly to f. Let n be a_ 
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positive integer. It is readily verified that 


[ namoy = pi [ fidvow;' 
x x 


N 
d, 
N 
=. 
iz] 

N 
> P 
f=zi 


frdvow,' 
w;(X) 


if fno wd. 
x 


The sequence { f fnd(M(v))} converges to f fd(M\(v)). 

For each i € {1,2,..., N} and each positive integer n, f, o w; is a simple func- 
tion. The sequence {f, o w;}"2, converges uniformly to f o w;. It follows that 
{f fr 0 widv}&, converges to f f 0 wj. It follows that {)-”, pi f fro widv}%, 
converges to et Di { f ° widv. This completes the proof. 


Theorem 6.1 Let (X, d) be a compact metric space. Let 


{X; W1, W2,.--, WN; Pi; P2;+ «+, pn} 


be a hyperbolic IFS with probabilities. Let s € (0,1) be a contractivity factor for 
the IFS. Let M : P(X) — P(X) be the associated Markov operator. Then M is a 
contraction mapping, with contractivity factor s, with respect to the Hutchinson 
metric on P(X). That is, 


dy(M(v), M(u)) < sdu(y, pw) for all v, w € P(X). 
In particular, there is a unique measure 1 € P(X) such that 
Miu =p. 


Proof Let L denote the set of continuous functions f : XK — R such that | f (x) — 
f(y)| < d(x, y)Vx, y € X. Then 


dy(M(v), M(u)) = sup | fatm(u) ~ [ fam): f € 2) 


N N 
= supt pifowdu- fy Dif ow; dv: f EL}. 
i=l iz] 


Let f=s >, pif ow;. Then fel. LetL={fel:f=s ">, pf ow, 
some f € L}. Then we can write 


dy(M(v), M(u)) = sup{s | fdu-s | fdv: f eL}. 


Since L C L, it follows that 
dy(M(v), M(u)) < sdy(y, p). 


This completes the proof. 
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Definition 6.2 Let u denote the fixed point of the Markov operator, promised by 
Theorem 6.1. is called the invariant measure of the IFS with probabilities. 


We have arrived at our goal! This invariant measure is the object we discussed 
informally in section 1 of this chapter. Now we know what fractals are. 


Examples & Exercises 


6.1. Verify that the Markov operator associated with a hyperbolic IFS on a compact 
metric space indeed maps the space into itself. 


6.2. This example uses the notation in the proof of Theorem 6.1. Let f € L and let 
f =s > N, pif 0 w;. Prove that f € L. 


6.3. Consider the hyperbolic IFS 
{mC R*; wi, wo, W3, Wa; Pi, Pr, P3, Da} 


corresponding to the collage in Figure IX.255(a). Let M be the associated Markov 
operator. Let wo € P(X), so that wo(@) = 1. For example, o could be the uni- 
form measure, for which jo(S) is the area of S € P(@). We look at the sequence 
of measures {z, = M°"(9)}. The measure 4; = M(j10) is such that 4(w;(@)) = p; 
for i = 1,2, 3,4, as illustrated in Figure IX.255(b). It follows that u2 = M °2 (110) 
obeys (w; o w;(@)) = pip; fori, j = 1, 2, 3,4, as illustrated in Figure [X.255(c). 
We quickly get the idea. When the Markov operator is applied, the “mass” in 
a cell mj, = Wj 0 Wj O--- 0 w,(®) is redistributed among the four smaller cells 
Ww (@ij...k), W2(Mij...k), W3(Bij...k), and w4(m@;..4). Also, mass from other cells is 
mapped into subcells of m;_; in such a way that the total mass of m;;..% Temains 
the same as before the Markov operator was applied. In this manner the distribution 
of “‘mass” is defined on finer and finer scales as the Markov operator is repeatedly 
applied. What a wonderful idea. We have also illustrated this idea in Figures [X.256 
and [X.257. 


6.4. Apply the Random Iteration Algorithm to an IFS of the form considered in 
example 6.3. Choose the probabilities pi, p2, p3, and p, so as to obtain a “picture” 
of the invariant measure that would occur at the end of the sequence that commences 
in Figure IX.257(a), (b), (c), and (d). 


6.5. Consider the IFS 
{{0, 1] C R; wy (x) = (0.5)x, w(x) = (0.7)x +. 0.3; p; = 0.45, pz = 0.55}. 


The attractor of the IFS is [0,1]. Let M denote the associated Markov operator. 
Let uo € P([0, 1]) be the uniform measure on [0, 1]. In Figure 1X.258(a), jo is 
represented by a rectangle, whose base is [0, 1] and whose area is 1. The successive 
iterates M(,19), M°?(49), M°>(z9) are represented in Figure [X.258(b), (c) and (d). 
Each measure is represented by a collection of rectangles whose bases are contained 
in the interval [0, 1]. The area of a rectangle equals the measure of the base of the 
rectangle. Although the sequence of measures converges {M°"(4o)} in the metric 


(a) 


: = es 
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Figure 1X.255. A col- 
lage for an IFS of four 
maps. The attractor of the 
IFS is @, and the prob- 
ability of the map w; 

is p; for i = 1, 2, 3, 4. 
Let M denote the as- 
sociated Markov oper- 
ator. Let 49 = 1. Then 
[41 = M({o) is a measure 
such that (w;(@)) = p; 
for i = 1, 2, 3, 4, as illus- 
trated in (b). The measure 
[42 = M°*(19) is such that 
H.(w; o w;(™)) = p;p; for 
i, j = 1,2, 3,4, as illus- 
trated in (c). See also Fig- 
ures IX.256 and IX.257. 


Figure IX.255. (b) 
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Figure IX.255. (c) 


space{P([(0, 1], dy}, some of the rectangles would become infinitely tall as n tends 
to infinity. 
6.6. Make a sequence of figures, analagous to Figure [IX.258(a)—(d), to represent 
the Markov operator applied to the uniform measure 40, for each of the following 
IFS’s with probabilities: 
(i){[O, 1] C R; wy (x) = (0.5)x, w(x) = (0.5)x + 0.5; p; = 0.5, pz = 0.5}; 
(ii){[O, 1] C R; wy (x) = (0.5)x, wo(x) = (0.5)x + 0.5; 
pi = 0.99, p> = 0.01}; 
(iti) {[0O, 1] C R; wy (x) = (0.9)x, w2(x) = (0.9)x + 0.1; py = 0.45, pp = 0.55}. 


In each case describe the associated invariant measure. 


6.7. Let X = {A, B, C} denote a space that consists of three points. Let B denote 
the o-field that consists of all subsets of X. Consider the IFS with probabilities 


{X; w1, w2; p; = 0.6, po = 0.4}, 


where w, : X —> X is defined by w,(A) = B, w;(B) = B, w,(C) = B, and w 2: 
X — X is defined by w2(A) = C, w2(B) = A, and w2(C) = C. Let P(X) denote the 
set of normalized measures on B. Let wo € P(X) be defined by wo(A) = uo(B) = 
juo(C) = 5. Let M denote the Markov operator associated with the IFS, and let 
Ln = M°" (uo) for n = 1,2, 3,.... Determine real numbers a, b, c, d, e, f, g, h, 
i such that for each n, 


[n(A) op € [n-1(A) 
Ur(B)|=|d e f Un-1(B) 
Un(C) yn 4 fn -JtO) 
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Figure IX.256. This il- 
lustrates the action of the 


Markov operator on one 
of the sequence of mea- 
sures {M°"(t1o9)}, where 


[Lo(@) = 1. When the 
Markov operator is ap- 
MASS IN CELL {S plied, the “mass” in a cell 
DISTRIBUTED TO | w, ,=w,0w,0---0 
SMALLER CELLS w;,(@) is redistributed 
among the four cells 
Ww (ij..k), W2(i;...4), 
w3(@;..%), and w4(@;__ x). 
Also, mass from other 
— cells is mapped into sub- 
cells of @;_, in such a 
way that the total mass of 
@,;__x Temains the same as 
before the Markov oper- 
ator was applied. In this 
manner the distribution 
of “mass” is defined on 
finer and finer scales as 
the Markov operator is 
repeatedly applied. 


MASS FROM 
ELSEWHERE 
MAKES UP 
THE LOSS 
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Figure IX.257. This 
sequence of figures repre- 
sents successive measures 
produced by iterative ap- 
plications of a Markov 
operator of the type con- 
sidered in Figures IX.255 
and IX.256. The result 
of one application of the 
operator to the uniform 
measure on M@ is repre- 
sented in (a). Figures (b), 
(c), and (d) show the re- 
sults of further successive 
applications of the Markov 
operator. The measures are 
represented in such a way 
as to keep the total number 
of dots constant. The mea- 
sure of a set corresponds 
approximately to the num- 
ber of dots it contains. 
This represents the first 
few of a sequence of mea- 
sures that converges in the 
metric space (P(@), dy) 
to the invariant measure of 
the IFS. 


Figure IX.257. (b) 
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Figure 1X.257. (c) 


Figure IX.257. (d) 
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a 


Figure IX.258. This | 
sequence of images re- 
lates to the IFS {[0, 1] Cc 

R; wi (x) = (0.5)x, w2(x) 

= (0.7)x + 0.3, p) = 

0.45, p2 = 0.55}. The at- 
tractor of the IFS is (0, 1]. 
Let M denote the associ- 
ated Markov operator. Let 
[to € P([O, 1]) be the uni- 
form measure on [0, 1]. 

The successive iterates 

M (to), M°*(0), (a) 


M°(po), and M™(j40) 
are represented in parts 
(a),(b),(c), and (d). Each 
measure is represented by 
a collection of rectangles 


whose bases are contained 

in the interval (0, 1]. The 

area of a rectangle equals 

the measure of the base of 

the rectangle. 
(b) ; 
(c) ii. 1 


ose 
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Figure IX.258. (d) 


Let M denote the 3 x 3 matrix here. Explain how M is related to M, and show that 
the invariant measure of the IFS can be described in terms of an eigenvector of M. 


6.8. Let 


{X; wi, W2,..-, WN; Pi, P2,---» Pn} 


be a hyperbolic IFS with probabilities. Let 4. denote the associated invariant mea- 
sure. Let A denote the attractor of the IFS. Let wo € P(X) be such that jo(A) = 1. 
By considering the sequence of measures {u, = M°"()}, prove that 


[(w; ow; o-++0 W(A)) = DiPj --- Pk: forall, J, =... € Uy 2... <0. 
Show that if the IFS is totally disconnected then the equality sign holds. 
Theorem 6.2 Let (X, d) be a compact metric space. Let 


{X; wi, W2,..., WN; Pi, P2,--+s PN} 


be a hyperbolic IFS with probabilities. Let 4 be the associated invariant measure. 
Then the support of is the attractor of the IFS {X; w,, w2,..., Wn}. 


Proof Let B denote the support of 1. Then B is a nonempty compact subset of 
X. Let A denote the attractor of the IFS. Then 


{A; W1, W2,--.-, WN; Pi; P2,-++, PN} 


is a hyperbolic IFS. Let v denote the invariant measure of the latter. Then v is also an 
invariant measure for the original IFS. So, since yw is unique, v = . It follows that 
B.C A. 

Let a € A. Let © be an open set that contains a. We will use the notation of 
Theorem 2.1 in Chapter IV. Let & denote the code space associated with the IFS 
and let o € > denote the address of a. It follows from Theorem 2.1 in Chapter IV 
that lim,,,.. @(o, n, A) =a, where the convergence is in the Hausdorff metric. It 
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follows that there is a positive integer n so that (0, n, A) C O. But 


U(o(o,n, A)) = Po, Po, --- Po, > 9. 


It follows that ~(O) > 0. It follows that a is in the support of yz. It follows that 
a € B. It follows that A C B. This completes the proof. 


Theorem 6.3 The Collage Theorem for Measures... Let 


{X; Wi, W2,..-, WN; Pi, P2,--+, Pn} 


be a hyperbolic IFS with probabilities. Let be the associated invariant measure. 
Let s € (0, 1) be a contractivity factor for the IFS. Let M : P(X) > P(X) be the 


associated Markov operator. Let v € P(X). Then 
dy(v,M 
d HQ, pL) < du(v, M(v)) 
i =s) 


Proof This is a corollary of Theorem 6.1. 


We conclude this section with a description of the application of Theorem 6.3 to 
an inverse problem. The problem is to find an IFS with probabilities whose invariant 
measure, when represented by a set of dots, looks like a given texture. 

A measure supported on a subset of R? such as @ can be represented by a lot of 
black dots on a piece of white paper. Figures [X.248 and IX.250 provide examples. 
The dots may be granules of carbon attached to the paper by means of a laser printer. 
The number of dots inside any circle of radius 5 inch, say, should be approximately 
proportional to the measure of the corresponding ball in R?. A gray-tone image in a 
newspaper is made of small dots and can be thought of as representing a measure. 

Let two such images, each consisting of the same number of points, be given. 
Then we expect that the degree to which they look alike corresponds to the Hutchin- 
son distance between the corresponding measures. Let such an image, L, be given. 
We imagine that it corresponds to a measure v. Theorem 6.3 can be used to help to 
find a hyperbolic IFS with probabilities whose invariant measure, represented with 
dots, approximates the given image. Let N be a positive integer. Let w; : R? > R? 
be an affine transformation, for i = 1, 2,..., N. Let 


Z; ° 
{R°; w1, W2,..., WN; P1, P2,---, Py} 


denote the sought-after IFS. Let M denote the associated Markov operator. 

Let p;&L mean the set of dots L after the “density of dots” has been decreased 
by a factor p;. For example 0.5&L means L after “every second dot” in L has been 
removed. The action of the Markov operator on v is represented by U_, w;(p;&L). 
This set consists of approximately the same number of dots as L. Then we seek 
contractive affine transformations and probabilities such that 


UN wi(pi&L) © L. (1) 
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That is, the coefficients that define the affine transformations and the probabilities 
must be adjusted so that the left-hand side “looks like” the original image. 

Suppose we have found an IFS with probabilities so that equation 1 is true. Then 
generate an image L of the invariant measure of the IFS, containing the same number 
of points as L. We expect that 


LL. (2) 


If the maps are sufficiently contractive, then the meaning of “~” should be the same 
in both equations 1 and 2. These ideas are illustrated in Figure [X.259. 


Examples & Exercises 


6.9. Use the Collage Theorem for Measures to help find an IFS with probabilities 
for each of the images in Figures IX.260, [X.261, and IX.262. 


Figure IX.259. This 
illustration relates to the 
Collage Theorem for 
Measures. The shades 
of gray “add up” in the 
overlapping regions. 
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Figure IX.260. Can 
you find the IFS and prob- 
abilities corresponding to 
this texture? 


Figure 1X.261. Deter- 
mine the IFS and probabil- 
ities for this cloud texture. 
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Figure 1X.262. Find 
the four affine maps and 
probabilities for this tex- 
ture. 


6.10. Estimate the probabilities and transformations used to make each part of 
Figure IX.248. 


6.11. Let 


{X; W1, W2,.--, WN; Dye Pec. ss Put 


be a hyperbolic IFS. Let 4 denote the invariant measure. Let A denote the attractor. 
Let © denote the associated code space on the N symbols {1,2,..., N}. Let 7; : 
x — LD be defined by 7;(0) = io, for all o € &, fori = 1, 2, 3, 4. Let p denote the 
invariant measure for the hyperbolic IFS 


{2; I}, Tp, T3, 14; Pi; P2, D3; Pa}. 


Let ¢ : & — A denote the continuous map between code space and the attractor of 
the IFS intoduced in Theorem 4.2.1. Prove that o(¢~'(B)) = w(B) for all Borel 
subsets B of X. 


6.12. Figure IX.263 depicts the invariant measure for the IFS {[0, 1] C R; w(x) = 
a\X, W2(x) = ax + 23 Pi, 

P2}, where a;, a2, and eé2 are real constants such that the attractor is contained in 
[O, 1]. The measure of a Borel subset of [0, 1] is approximately the amount of black 
that lies “vertically” above it. Find a), a2, and ep. 
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Figure IX.263. This 
figure depicts the invari- 
ant measure for the IFS 
{[0, 1] CR; wi(x) = 
Q\X, W2 = Ax + e2; 

Pi, P2}, where aj, a2, and 
€2 are real constants such 
that the attractor is con- 
tained in [0, 1]. The mea- 
sure of a Borel subset of 
[0, 1] is approximately the 
amount of black that lies 
“vertically” above it. Can 
you find a, a2, and e2? 


7 Elton’s Theorem 


Both the following theorem and its corollary claim that certain events occur “with 
probability one.” Although this has a very precise technical meaning, it is fine to in- 
terpret it in the same way as you would interpret the statement “There is a 100% 
chance of rain tomorrow.” After the statements we mention the mathematical frame- 
work used for dealing with probabilistic statements. To go further we recommend 
reading parts of [Eisen 1969]. 

The theorem below is actually true when the p;’s are functions of x, the w;’s are 
only contraction mappings “on the average,” and the space is “locally” compact. 


Theorem 7.1 Let (X, d) be a compact metric space. Let 
{X; wi, W2,..., WN; Pi, P2,---, Pn} 


be a hyperbolic IFS with probabilities. Let (X,d) be a compact metric space. Let 
{Xn}i=9 denote an orbit of the IFS produced by the Random Iteration Algorithm, 
Starting at xo. That is, 


Xn = Wo, 0 Wo,_; 9°'* O Wo, (Xo), 
where the maps are chosen independently according to the probabilities 


Pi; P2;-++;DN> for n= 4, 2,3, 2. 
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Let be the unique invariant measure for the IFS. Then with probability one (that 
is, for all code sequences 0;, 02, ... except for a set of sequences having probability 
zero), 


128 
diy oe 2 fw) = [ f (x) du(x) 


for all continuous functions f :X — R and all xo. 
Proof See [Elton 1987]. 
Corollary 7.1 Let B be a Borel subset of X and let u(boundary of B) = 0. Let 
N(B,n) = number of points in {xo, x1, x2, X3,.-.,%n} OB, forn =0, 1,2,.... 


Then, with probability one, 


el 
(n+ 1) 


for all starting points xo. That is, the “mass” of B is the proportion of iteration steps, 
when running the Random Iteration Algorithm, which produce points in B. 


(B) = lim | 


n-—> 00 


Let’s explain more deeply the context of the statement “with probability one.” Let 
> denote the code space on the N symbols {1,2,..., N}. Let p denote the unique 
Borel measure on & such that 


p(C (04, 02, .--,Om)) = Po, Por +++ Pom 
for each positive integer m and all 01, 02,..., Om € {1,2,..., N}, where 
Clay, 0, . 3, On) = {0 € Bt wy = 0], Go = Oo, ..., On = Gm). 


Then p € P(X). This measure provides a means for assigning probabilities to sets 
of possible outcomes of applying the Random Iteration Algorithm. Let us see how 
this works. 

When the Random Iteration Algorithm is applied, an infinite sequence of symbols 
@}, @2, @3,..., namely a code w = @1@203... € LZ, is generated. Provided that we 
keep xo € X fixed, we can describe the probabilities of orbits {x,} in terms of the 
probabilities of codes w. So we examine how probabilities are associated to sets of 
codes. 

The Random Iteration Algorithm is applied and produces a code w € &. What is 
the probability that #; = 1? Clearly it is p; = p(C(1)). What is the probability that 
@ = 01, 2 = 02, ..., and w, = 0,? Because the symbols are chosen independently, 
it 1s 

PCG, Or, <<, Om)) = Po, Be, -+- Pa. 


Let B denote a Borel subset of &. What is the probability that the Random Itera- 
tion Algorithm produces a code o € B? It is at least intuitively reasonable that it is 
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p(B). This can be formalized, see, for example [Eisen 1969]. The measure p pro- 
vides a means of describing the probabilities of outcomes of the Random Iteration 
Algorithm. 

Here is a heavy way of stating the central part of Theorem 7.1. “... Let xo € X. 
Let B C 2 denote the set of codes o € & such that 


n 


; 1 
noon + |] >, fx) =| #@ duc, 


k=0 


for all xo € X and all continuous functions f : X — R, where 
XW OW, 9 .., 6 We iXo). 


Then B is a Borel subset of & and p(B) = 1.” A similar heavy restatement of the 
corollary can be made. 


Examples & Exercises 


7.1. This example concerns the IFS 


bo. st 1 
, 1); =x, = ~;, VU.J, U.I5. 
{{0, 1] ge eas 0.5, 0.5} 


Show that the invariant measure yz is such that w([x, x + 4]) = A when [x, x + 4] is 
a subinterval of [0, 1]. Deduce that if f : [0, t] — R is a continuous function then 


1 
[ feac= f Ee 
0 [0,1] 


Let f(x) = 1+ x”. Compute approximations to the latter integral with the aid of 
Elton’s theorem and the Random Iteration Algorithm. Compare your results with the 
exact value ;. 


7.2. This example concerns the IFS 
{mC R*: wy, w2, w3, wa; 0.25, 0.25, 0.25, 0.25} 


corresponding to the collage in Figure [X.255(a). Let 4 denote the invariant mea- 
sure. Argue that yz is the uniform measure that assigns “measure” dxdy to an in- 
finitesimal rectangular cell of side lengths dx and dy. Use Elton’s theorem and the 
Random Iteration Algorithm to evaluate approximations to 


[oe + 2xy + 3y’) dxdy. 
ip 


Compare your approximations with the exact value. 


7.3. This example concerns the IFS 


}, 


WW] — 


i 1 
A R?: ’ 9 <<. 5 
{A CR’; wy, wo, w3 33 


where 
1 


a | 1 4 1 1 1 
w(x, y) = (52 5»): w(x, y) = (5% ay + 5)» w3(x, y) = (5% = 5 5»): 
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and A is the attractor of the IFS, our old friend. Let 4 denote the invariant measure 
of the IFS. Argue that jz provides a good concept of a “uniform” measure on A. Use 
Elton’s theorem and the Random Iteration Algorithm to compute approximations to 


[.@ + 2xy + 3y*) dxdy. 


In Chapters II, III, and IV, we introduced the space LX of shifts on N symbols. 
It was mentioned in passing in Chapter IV that any invertible mixing function could 
be represented by a baker’s transformation with “uneven cutting and stretching.” We 
are now in a position to show how this comes about using an example-involving two 
simple IFS. The same model with some necessary refinements yields the code space 
mixing model used to justify the representations. It is easier to visualize without the 
refinements, as we present it here. It is one of the most important properties involved 
with the modelling of physical chaos. 

We begin with perhaps the most simple of all IFS with probabilities. On the 
interval [0, 1], we define the just-touching IFS with N maps and with probabilities 


{[0, 1); Wi, W2,.--, WN; Pi, P2,---; Pn}; 


where 
l 
w(x) = 3 
io rt 
Ma) aT N 
1 3 
w3(x) = Hx + 
@ane tea} 
wn (x) = ax NO? 


and the probabilities are arbitrary, subject to the usual condition 


N 
> het 


i=l 


Associated with this IFS there is an invariant measure on [0, 1], which we denote 
by v. 

Now we define another IFS on [0, 1], this time without probabilities, using the p; 
from above. On [0, 1] define the IFS 


{[0, 1); V4, U2,..--, vy}, 


where 
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v(x) = pix 
V2(x) = pox + py 
v3(x) = p3x + (pi + p2) 


k—-1 
ve(x) = pex + — pi 


i] 


N-1 
UnN(X) = pyx + ye: Di- 
i=] 

This IFS is also just-touching by construction and, because the probabilities from the 
first IFS sum to one, has as its attractor the interval [0, 1] as well. We are going to 
use it to define an equivalent metric on [0, 1] as follows: 

Each point has a unique address under this IFS in code space, except the points 
v;(A), whose multiple addresses correspond to 


o =iN-—1=(i+1)0. 


These are precisely the points in a base N expansion of a real number which 
are equated to form the real line. We denote the value of a point x with address 
X\X2x3... in this new metric space to be the real number with N-ary expansion 
.X1X2X3.... In effect we have given each point the numeric value that would corre- 
spond to having measured its distance from say 0 with a ruler on which the spacing 
of the tick marks had been made uneven in a very specific way by the IFS. 

With these values, the space is still [0, 1], but we put a metric on it by assigning 
the distance between two real numbers to be the distance measured with a “normal” 
ruler. Another way to put it is that we take the normal interval [0, 1] and assign the 
distance between two points to be the distance between the addresses corresponding 
to their N-ary expansions in the above IFS. Thus if N = 10 for instance, the distance 
between .251 and .137 is not .251 — .137, but rather the distance between the points 
with addresses 2510 and 1370 in the IFS {[0, 1]; vi, v2,..., vj0}. We will call this 
space [0, 1],, and the distance function d, to avoid confusion. 

We have a metric space, so we will now assign a Borel measure to it by defining 
L([a, b]) = w((a, b)) = d,(a, b), which is uniform for this metric space. And to 
proceed with the example, we need a function, f : [0, 1] — [0, 1], which we define 
by f(x) =(point with value x in [0, 1],). Because the definition was very careful to 
preserve the ordering of the real line and its conventions about multiple addressing, 
f is both a homeomorphism and a metric equivalence. Because it is continuous, it 
is also what is called a measurable function in that if A € B((0, 1],) then f~'(A) € 
B((0, 1]). 


7.4. Show that f is measure-preserving with respect to the invariant measure on v 
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associated with 


{(O, 1]; wi, w2,..., WN; Pi, P2,---» Pn}; 
that is, that for any Borel subset A € [0, 1], we have v(A) = w(f(A)). 


We now have the machinery to cast the Random Iteration Algorithm entirely in 
terms of IFS with no recourse to randomness. It is really a deterministic model, with 
the random part coming in to help when a very simple statement made all the time 
in mathematics turns out to be something a computer cannot do. 

The exact transfer of the Random Iteration Algorithm to the model using the 
space ((0, 1],, dp) looks like this: define the function g : [0, 1] — [0, 1], by 


dp(g(x),0) =x. 


Define the map h : [0, 1], — {0,1,2,..., N — 1} defined by h(p) = [Np] where 
[-] is the greatest integer function. In other words, take the first N-ary digit of the 
value of the point p € [0, 1],. Define the map y : [0, 1] — [0, 1] given by y(x) = 
Nx mod N. Then the Random Iteration Algorithm is precisely the iteration of the 
map R : [0, 1] x X given by 


R(p, x) = (y(p), Whog(p)(X))- 


Where does the random part of the algorithm come in? We need it to “pick a real 
number.” One can think of the random number at each iteration as a function to get 
the next digit of the real number we “picked.” In the above expression, we get a 
random number and find out which function to use via h(g(p)). Then we iterate the 
IFS using wWhog(p)(x), and in order to have a new “random number” the next time, we 
advance p to the next digit using y(p). 

Now, consider the space [0, 1], x [0, 1]. Think of it as a square with coordinates 
spaced unevenly in the x direction and evenly in the y direction. Your “usual” point 
in the square (where here usual means with probability one) has an N-ary expansion 
for y in which every digit occurs with equal probability, while the x value has an N- 
ary expansion in which 0 occurs with probability p,, 1 occurs with probability po, 
etc. 


7.5. Draw a diagonal from (0, 0) to (1, 1) on this square. Show that this statement 
is still true if we pick a “usual point” from this diagonal. 


7.6. Draw asmooth curve from (0, 0) to (1, 1) on the square. Then the statement is 
still true if we pick a “usual point” from this curve. 


By using the diagonal in exercise 7.5, we can take a point x in [0, 1] and map it to 
a new point x, by putting x along the vertical coordinate and reading the horizontal 
coordinate like a web diagram. In terms of all the functions we have defined, this 
operation is x = f ~!(g(x)). Under the original IFS with probabilities, this new point 
will, with probability 1, have an orbit under the shift dynamical system {A; S} with a 
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distribution of dots identical to the one we would get by using the Random Iteration 
Algorithm with probabilities {p;, p2,..., pw}. 

There seems to be a lot of mileage in this square with the strange coordinates. 
There should be; the uneven coordinates correspond to future cuts and stretches for 
the baker’s transformation with uneven stretches and cuts. (A real baker’s transfor- 
mation would not use the just-touching IFS used here, but it’s easier to visualize, 
and for the most general case N is allowed to be infinite.) It is a mixing function, so 
it automatically satisfies the equation that results from Elton’s theorem (a property 
called ergodicity). The theorem takes care of how little “hyperbolicity” an IFS can 
have and still retain this property. Alternatively, Elton’s theorem can be viewed as a 
set of minimal requirements on the w; such that the baker’s transformation as set up 
here accurately reflects the behavior of the IFS-on addresses. 


8 Application to Computer Graphics 


We begin by illustrating how a color image of the invariant measure of an IFS with 
probabilities can be produced. The idea is very simple. We start from an IFS such as 


{C; 0.5z + 24 + 247, 0.5z + 24%, 0.5z; 0.25, 0.25, 0.5}. 


A viewing window and a corresponding array of pixels P;; is specified. The Random 
Iteration Algorithm is applied to the IFS, to produce an orbit {z,:n =0,1,..., 
numits}, where numits is the number of iterations. For each (i, 7) the number of 
points, N(P;;), which lie in the pixel P;; are counted. The pixel P;; is assigned the 
value NV (P;;)/numits. By Elton’s theorem, if numits is large, this value should be a 
good approximation to the measure of the pixel. The pixels are plotted on the screen 
in colors determined from their measures. 

The following program implements this procedure. It is is written in BASIC. 
It runs without modification on an IBM PC with Enhanced Graphics Adaptor and 
Turbobasic. 


Program 1. (Uses the Random Iteration Algorithm to Make a “Picture” of the Invariant 
Measure Associated with an IFS with Probabilities) 
screen 9: cls ’Initialize graphics. 


dim s(51,51) ’Allocate array of pixels. 


7IFS code for a Sierpinski triangle. 
a(1)=0.5 : b(1)=0 : c(i)=0 : d(1)=0.5 : e(1)=24 : £(1)=24 


a(2)=0.5 : b(2)=0 : c(2)=0 : d(2)=0.5 : e(2)=0 : £(2)=24 
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a(3)=0.5 : b(3)=0 : c(3)=0 : d(3)=0.5 : e(3)=0 : £(3)=0 


*Probabilities for the IFS; they must add to one! 
p(1)=0.25 : p(2)=.25 : p(3)=.5 


mag=1 ’Magnification factor. 


?Increase the number of iterations as you magnify. 
numits=5000 


factor =100 ’Scales pixel values to color values. 


This is the number of colors you are able to use. 
numcols=8 


for n=1 to numits ’Random iteration begins! 
r=rnd : k=1 ’Pick a number in [0,1] at random. 
if r > p(1) then k=2 

if r > p(i)+p(2) then k=3 

’Map k is picked with probability p(k). 
newx=a[k]*x + b[k]*y + e[k] 


newy=c[k]*x + d{k]*y + f[k] 


x=newx : y=newy 


i=int(mag*x) : j=int(mag*y) ’Scale by magnification factor. 


if (((i < 50) and (i>=0))and((0=<j) and (j<50))) then 


7If the scaled value is: 


s(i,j)=s(i,j)+i 
>. ..in the array add one to pixel (i,j). 


end if 
pset(i,j) ’Plot the point. 
if instat then end ’Stop if a key is pressed. 


next 
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for i=0 to 49 ’Normalize values in pixel array, and plot... 
for j=0 to 49 ’...in colors corresponding to the normalized... 


?,..values of the numbers s(i,j). 
col=s(i,j)*numcols*factor*mag*mag/numits 


Plot the pixel (i,j) in the color determined by... 
pset(i,j),col 


next j ’...its measure. 
next i 


end 


The program allows the user to zoom in on a piece of the rendered measure 
by altering the value of the magnification parameter mag. The result of running 
an adaptation of this program on a Masscomp workstation and then printing the 
contents of the graphics screen is shown in Figure [X.264. 

Rendered invariant measures for IFS’s acting in R* are also shown in Fig- 
ure [X.265. 

By carrying out some simple computergraphical experiments, using a program 
such as the one above, we discover that “pictures” of invariant measures of IFS’s 
possess a number of properties. (i) Once the viewing window and color assignments 
have been fixed, the image produced is stable with respect to the number of itera- 
tions, provided that the number of iterations is sufficiently large. (ji) Images vary 
consistently with respect to translation and rotation of the viewing window, and 
with respect to changes in resolution. In particular they vary consistently when they 
are magnified. (iii) The images depend continuously on the IFS code, including the 
probabilities. Property (i) ensures that the images are well defined. The properties in 
(ii) are also true for views of the real world seen through the viewfinder of a cam- 
era. Property (iii) means that images can be controlled interactively. These properties 
suggest that IFS theory is applicable to computer graphics. 

We should, if we have done our measure theory homework, understand the rea- 
sons for (i) and (ii). They are consequences of corresponding properties of Borel of 
measures on R?. Property (iii) follows from a theorem by Withers [Withers 1987]. 


Examples & Exercises 


8.1. Rewrite Program 1, section 8, in a form suitable for your own computer envi- 
ronment. Adjust numits and factor to ensure that a stable image results. Then make 
experiments to verify that the conditions (i)—(1i1) above are verified. For example, to 
test the consistency of images with respect to changes in resolution you should try 
mag = 0.5, 1, and 1.5. Unless you have a very powerful system, do not make ex- 
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Figure 1X.264. The 
result of running a mod- 
ified version of Program 
9.8.1 and then printing 
the contents of the graph- 
ics screen in gray tones. 
A rendered picture of a 
measure is the result. 


treme adjustments. For example, do not choose mag too small, otherwise you will 
need a very large value for numits. 


Applications of fractal geometry to computer graphics have been investigated 
by a number of authors including Mandelbrot [Mandelbrot 1982], Kawaguchi 
[Kawaguchi 1982], Oppenheimer [Oppenheimer 1986], Fournier et al. [Fournier 
1982], Smith [Smith 1984], Miller [Miller 1986], and Amburn et al. [Amburn 1986]. 
In all cases the focus has been on the modelling of natural objects and scenes. Both 
deterministic and random geometries have been used. The application of IFS theory 
to computer graphics was first reviewed in [Demk 85]. It provides a single frame- 
work that can reach an unlimited range of images. It is distinguished from other 
fractal approaches because it is the only one that uses measure theory. 

The modelling of natural scenes is an important area of computer graphics. Pho- 
tographs of natural scenes contain redundant information in the form of subtle pat- 
terns and variations. There are two characteristic features: (i) the presence of com- 
plex geometrical structure and distributions of color and brightness at many scales; 
and (ii) the hierarchical layout of objects. (i) Natural boundaries and textures are not 
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Figure 1X.265. Ren- 
dered invariant measures 
for IFS’s of two maps. 


(a) 


smoothed out under magnification; they preserve some degree of geometrical com- 
plexity. (ii) Natural scenes are organized in hierarchical structures. For example, a 
forest is made of trees; a tree is a collection of boughs and limbs along a trunk; on 
each branch there are clusters of leaves; and a single leaf is filled with veins and 
covered with fine hairs. It appears often in a natural scene that a recognizable en- 
tity is built up from numerous near repetitions of some smaller structure. These two 
observations can be integrated into systems for modelling images using IFS theory. 


Examples & Exercises 


8.2. Examine a good-quality color photograph of a natural scene, such as can be 
found in a Sierra Club calender, or an issue of National Geographic. Discuss the 
extent to which (i) and (ii) are true for that photograph. Be specific. 
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Figure IX.265. (b) 


In [Barnsley 1988a] it is reported that IFS theory can be used efficiently to model 
photographs of clouds, mountains, ferns, a field of sunflowers, a forest, seascapes 
and landscapes, a hat, the face of a girl, and a glaring arctic wolf. 

There are two parts to making any computer graphics image: geometrical mod- 
elling and rendering. Consider an architect making a computergraphical house: first 
she defines the dimensions of the floor, the roof, the windows, the shapes of the 
gables, and so on, to produce the geometrical model. Traditionally this is specified 
in terms of polygons, circles, and other classical geometrical objects that can be con- 
veniently input to the computer. This model is not a picture. To make a picture, the 
model must be projected into two dimensions from a certain point of view and dis- 
tance, discretized so that it can be represented with pixels, and finally rendered in 
colors on a display device. 

Here we describe briefly the software system designed by the author, Alan Sloan, 
and Laurie Reuter, which was used to produce the color images that accompany 
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this section. More details can be found in [Reuter 1987] and [Barnsley 1988a]. The 
system consists of two subsystems known as Collage and Seurat. Collage is used for 
geometrical modelling, while Seurat is used for rendering. 

Collage and Seurat process IFS structures of the form 


{Ror Ww), W2,..., Wi Pp Danes Pa 12, ..., NE 


where the maps are affine transformations in R?. An IFS is represented by a file that 
consists of an IFS code, where each coefficient is written with a fixed number of bits 

Let denote the invariant measure of such an IFS and let A denote the attractor. 
The pair (A, 2) is referred to as an underlying model. The attractor A carries the 
geometry while u carries the rendering information. One can think of the IFS code, 
or equivalently (A, jz), as being analagous to the plans of an architect. It corresponds 
to many different pictures. 

Collage is a geometrical modelling system used to determine the coefficients 
of the affine transformations w), w2,..., wy. It is based on the Collage Theorem. 
Seurat is a software system for rendering images starting from an IFS code. An 
image is produced once a viewing window, color table, and resolution have been 
specified. This is achieved using the Random Iteration Algorithm. Its mathematical 
basis is Elton’s Theorem. Seurat is also used in an interactive mode to determine the 
probabilities and color values. 

The input to Collage is a target image, which we denote here by 7. For example, 
T may be a polygonal approximation to a leaf. We suppose that 


T Cm=({(x,y)€R?:0<x<1,0<y<}}, 


and that the screen of the computer display device corresponds to m. T is rendered 
on the graphics workstation monitor. An affine transformation 


b 
wi(x,y)= & me = + = =A,x +t 


is introduced, with coefficients initialized at aj = d,; = 0.25, b} =c, =e; = f. The 
image w,(T) is displayed on the monitor in a different color from T. w,(T) is a 
quarter-sized copy of 7, centered closer to the point (0, 0). The user now interac- 
tively adjusts the coefficients with a mouse or some other interaction technique, so 
that the image w}(7) is variously translated, rotated, and sheared on the screen. The 
goal of the user is to transform w,(T) so that it lies over part of T. It is important that 
the dimensions of w,(7) are smaller than those of 7, to ensure that w, is a contrac- 
tion. Once w;(T) is suitably positioned, it is fixed, and a new subcopy of the target, 
w2(T), is introduced. w2 is adjusted until w2(T) covers a subset of those pixels in 
T that are not in w;(7). Overlap between w,(T) and w2(T) is allowed, but in gen- 
eral it should be made as small as possible, for efficiency. New maps are added and 
adjusted until Uj w j(T) is a good approximation to 7. The output from Collage is 
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the resulting IFS code. The probability p; is chosen proportional to |a;d; — bjc;| if 
this number is nonzero, and equal to a small positive number if the determinant of 
A; equals zero. 

The input to Seurat is one or more IFS codes generated by Collage. The view- 
ing window and the number of iterations are specified by the user. The measures of 
the pixels are computed. The resulting numbers are multiplied by the inverse of the 
maximum value so that all of them lie in [0, 1]. Colors are assigned to numbers in 
[0, 1] using a color assignment function. The default is a gray scale where the inten- 
sity 1S proportional to the number, such as 0 corresponds to black and 1 corresponds 
to brightest white. The coloring and texture of the image can be controlled through 
the probabilities and the color assignment function. Although one does not explic- 
itly use it, Theorem 9.6.3 lies in the background and can help in the adjustment of 
the probabilities. 

Color Plate 21 shows some smoking chimneys in a landscape. We obtained the 
IFS codes for the elements of this image we obtained using Collage. Different color 
assignment functions are associated to different elements in the image. The image 
was rendered using Seurat. 

The consistency of images with respect to changes in resolution is illustrated in 
Color Plate 22, which shows a zoom on one of the smokestacks in Color Plate 21. 
The number of iterations must be increased with magnification to keep the number 
of points landing within the viewing window constant. This requirement ensures the 
consistency of the textures in an image throughout the magnification process. 

Color Plates 23 and 24 show various renderings of leaves produced by Seurat. 

Color Plate 25 shows a sequence of frames taken from an IFS encoded movie en- 
titled A Cloud Study [Barnsley 1987]. The smooth transition from frame to frame is 
a consequence of the continuous dependence on parameters of the invariant measure 
of the IFS for the cloud. 

Color Plates 26, 27, and 28 were encoded from color photographs. Segmenta- 
tion according to color was performed on the originals to define textured pieces. IFS 
codes for these components were obtained using Collage. The IFS data base con- 
tained less than 180 maps for the Monterey seascape, and less than 160 maps for the 
Andes Indian girl. 

The two primitives, a leaf and a flower, in Color Plate 29 were used as condensa- 
tion sets in the picture Sunflower Field, Color Plate 30. Here we see the hierarchical 
structure: the leaf is itself the attractor of an IFS; and the flower is an overlay of 
four IFS attractors. The leaf is a condensation set for the IFS that generates all of 
the leaves. The flower is a condensation set to an IFS that generates many flowers, 
converging to the horizon. In the pictures Sunflower Field and Black Forest, shown 
in Color Plates 31-34, the primitives were displayed from back to front. The data 
bases for the Sunflower Field and Black Forest contain less than 100 and 120 maps, 
respectively. Notice the shadows behind the little trees in the background in Color 
Plate 32. The winter forest pictures were obtained by adjusting the color assignment 
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function. The important point is that once the adjustment has been made, the image 
and the zoom are consistent. 


Examples & Exercises 


8.3. Use the Collage Theorem to help you find an IFS code for a leaf. Adjust 
your version of Program 1 in section 8 to allow you to render images of associated 
invariant measures. Assign colors in the range from red through orange to green. 
Adjust the probabilities. Obtain a spectacular color picture of the leaf showing the 
veins. Make a color slide of the output. To photograph a picture on the screen of a 
computergraphics monitor, use a telephoto lens. Mount the camera on a tripod, and 
take the photograph in a darkened room, on Ectachrome 64 ASA color slide film, 0.1 
sec exposure, f-stop 5.6. For possible publication, submit the color slide, together 
with a letter of copyright assignment, to Michael Barnsley, Iterated Systems, Inc., 
5550-A Peachtree Parkway, Suite 650, Norcross GA 30092 USA. Include a self- 
addressed envelope. 


8.4. Obtain a very powerful computer with good graphics. Find the heirarchical 
IFS codes for the Sunflower Field. Replace the sunflowers by roses. Fly into your 
picture, to explore forever that scent-filled horizon. You are on your own. 


Chapter X 


Recurrent Iterated Function 


Systems 


1 Fractal Systems 


The goal of this chapter is to describe some general systems that can be used to con- 
struct deterministic fractals. We are concerned with the invention of mathematical 
machines for designing and controlling fractals. 

We use the name “recurrent iterated function systems” loosely to encompass the 
systems introduced in this chapter. Examples include “IFS’s” in which the applica- 
tion of transformations to points depends on the “addresses” of the points. Other 
examples include “IFS’s with probabilities” which have invariant measures that can 
be computed by means of algorithms of the “Chaos Game” type, wherein, instead 
of each transformation being applied with a fixed probability, there are different 
probabilities that depend on which transformation was applied previously. A unified 
framework for presenting IFS, recurrent IFS, and future developments in the theory 
of deterministic fractals is provided by the idea of fractal systems. 

In this section we introduce fractal systems, their associated fractals, and the 
objects or models which they can be used to approximate. We do this in such a 
way as to help the reader understand how to invent a fractal system appropriate to 
the type of model she or he is describing, without having first to develop a new 
theory from the bottom up. The ideas are loosely stated in order to convey most 
simply the underlying principles, which seem to be of general applicability and to 
be very powerful. From this author’s experience, it seems that any framework for a 
fractal system, once decided upon, can be readily fleshed out with the appropriate 
definitions and logical infrastructure to provide a full and clean theory. 

Roughly speaking, a fractal system consists of a means for generating determin- 
istic fractals, associated with an underlying space X, and a set or space of objects or 
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models, Y, which these fractals can be used to approximate. Throughout this chap- 
ter we use ( to denote a closed bounded subset of R? such as {(x, y) € R?:0<x, 
y <1}. 

Already in this book we have looked at several very different fractal systems, as 
described in the following examples. 


Example (i). One example of a fractal system is defined by IFS’s of contractive 
affine transformations acting on X = C R?: the underlying space is 0; the fractals 
of the system are attractors of all of the IFS’s; and the space of models, Y, which 
these fractals can be used to approximate, consists of all (compact) subsets of D, 
namely Y = H(X). In this example the models may represent monochrome pictures, 
with the sets representing the black parts of the pictures, and their complements 
representing the (background) white parts. The fractal system itself is a device for 
approximating the models by means of the fractals that it generates. 


Example (ii). Another example of a fractal system is defined by the IFS’s of 
contractive affine transformations on () C R?, with probabilities. The fractals of this 
system are the attractors of the associated Markov operators, as described in Chapter 
IX. They lie in the space Y = P(Q), the set of all normalized Borel measures on 
OC R2. Here the underlying space X is O C R?; its points do not belong to the 
fractals of the system because they are measures, not sets of points. In this example 
the models may represent grayscale pictures, with the amount of light reflected by a 
subset of the picture represented by the measure of that subset. Again, the fractal 
system provides a means for approximating the models, using the fractals that it 
generates. 


Example (iii). A third example of a fractal system involves the space X = 
[0, 1] C R?, with Y = C[O0, 1], the continuous real valued functions on [0, 1]. In this 
system the fractals are the fractal interpolation functions on [0, 1] that can be gener- 
ated by affine transformations, as described in Chapter VI; and the the models may 
represent time series data or stock market prices. 


Further examples of fractal systems are introduced in this chapter, and involve 
recurrent iterated function systems. Yet others, involving local iterated function sys- 
tems and appropriate to the modelling of real world images, are described in [BH].' 
All of these fractal systems use the same basic ingredients, which we describe next. 


(1) An underlying metric space (X, d) needed to define the fractals and the mod- 
els in the system. For example, X may be R’, R°, or a subset of one of these 
spaces, such as) C R?. Typically (X, d) is complete, and bounded subsets of 
it are compact. 

(2) A space of models Y = Y(X). Each point of Y is a model, and models are de- 
fined with the aid of the space X. The fractals generated by the fractal system 


''M. F. Barnsley and L. P. Hurd, Fractal Image Compression, (1993), A. K. Peters, Welles- 
ley, Massachusetts. 


1 Fractal Systems 


also belong to Y. Examples of Y include spaces of sets, function spaces, and 
measure spaces. We also need a metric h on the space Y, such that (Y, h) is a 
complete metric space. Examples of (Y, ) include (H(X), the Hausdorff dis- 
tance generated by d), (C(O, 1], h(f, g) = max{| f(x) — g(x)|: x € [0, 1]}), 
and (P(X), d),) where d;, is the Hutchinson distance between measures. 

(3) A contractive operator O which acts on the space (Y, h). That is, the operator 
O is such that there exists a real number s with 0 < s < 1, and 


h(O@), O(y)) ss -h@, ~) forall g, p € Y. 


Typically, the operator O is constructed with the aid of elementary contractive 
functions that act upon the underlying space X. 


For the examples of fractal systems, (i), (ii), and (iii) begun above, the operators 
O are set up as follows: 


Example (i) (continued). X =o Cc R’, w; :G > Dis a contractive transfor- 
mation for eachi = 1,2,...,N, Y = H(X), and O: Y — Y is defined by 


O(Y) = W(Y) =|_J{wi(x) : x € Y}. 


Remark that O is built up using contractive transformations acting on the underlying 
space. 


Example (ii) (continued). X =O Cc R’, {X; w1,..., wy; pi,.--, pw} are 
IFS’s with probabilities, as in section 6 in Chapter [X, with the w,’s as in (i) above, 
and Y = P(X); we can then define a corresponding fractal system by taking O to 
be the associated Markov operator as defined in section 6 in Chapter IX; that is, 
O:Y — Y is defined by 


O(v) = M(v) = pivow,' + povow;! +:+-+ pyvowy’. 


We know from the theory of Chapter IX that O is contractive in the Hutchinson 
metric. Again, remark that O is built up using contractive transformations, the wj’s, 
acting on the underlying space. 


Example (iii) (continued). In this example fractal interpolation functions, as 
described in Chapter VI, are used. X = [0, 1] C R, and Y= F = C(O, 1]. Then O: 
Y — Y is the operator T : F — F defined in the proof of Theorem 2.2 in Chapter 
VI. In the notation of section 2 in Chapter VI, this is, for f € F, 


O(f) = (TF)(%) = eal (®) + da f G'(@)) + Si 
for x 61%... 4,4, |, torn = 1... .N: 
Then O is a contraction mapping with respect to the supremum distance on C[0, 1]. 
Again, notice how the operator O is built up from contractive transformations acting 
on the underlying space. 
In practice, the contractive operator O is constructed from finite collections of 
contractive affine transformations; for example, affine transformations acting in two 
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dimensions. Furthermore, the coefficients of these constituent functions can be re- 
stricted to lie in finite sets, defined for example by rounding. Then O itself can be 
described by finite sets of discrete coefficients, the full set of which represents the 
“code” for the operator O. 

The consequences of having the ingredients 1, 2, and 3 can be summarized in the 
following Theorems and Expections: 


Theorem 1.1 Existence of Aftractors. Since O is contractive, and the metric 
space Y is complete, there exists an unique attractor  € Y such that 


Oo) =¢ 


Proof See Theorem 6.1 in Chapter III. 


Expectation 1.1 Fractal Character of Affractors. We anticipate that 
@ is a fractal: that is, we expect that $ has a resolution-independent, infinitely- 
magnifiable character. This is because of the contractivity of the functions from 
which O is constructed: the whole invariant set is the same as a sum or union of 
contractions applied to it, and thus it is made of shrunken copies of (parts of) itself. 
Depending on the way in which the contractions act, the focus may be on contrac- 
tivity of various spatial dimensions, and/or measure theoretic contractivity, and we 
expect that the attractor $ will inherit corresponding fractal characteristics. 


Theorem 1.2 Computation of Attractors. To compute $, we can use the 
fact that if w € Y then the result of repeatedly applying O to w converges to the 
attractor $; that is, 

lim O°'(w) = @. 
n—> oo 
Moreover, if there exists a real constant C such that h(9,, $2) < C for all $1, $2 € Y, 


then we have the error estimate 


h(O"(v), 6) <s"C. 


Proof See Theorem 6.1 in Chapter III. 

The latter equation tells us that the fixed point, or attractor, can be computed 
by algorithms of the “Photocopy Machine” type. The error estimate allows one to 
predict the number of iterations required to achieve a given accuracy. 

Theorem 1.3 General Collage Theorem Estimate. The distance between 
w € Y and the attractor $ of O is bounded by the estimate 
h(y, O()) 


h(¢, vv) < 
@,v)< ac 
Proof See Lemma 11.1 in Chapter III. 


The set O(y) is called a collage of w, while the distance h(y, O(y)) is called the 
corresponding collage error. The theorem says that if we wish to find an operator 
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O whose attractor @ is approximately y, then we have only to solve the problem 
of choosing O such that application of O to y does not change w very much, as 
discussed in Chapters III and IX. 

This latter simple prescription is of central importance, yet easy to miss. We 
repeat it in slightly different terms. Suppose that we have a fractal system, defined by 
Y(X) and O, where © is a set of operators O as above. We seek an operator O € O 
whose attractor @ approximates y € Y. To do so, we adjust O so that the collage 
error h(y, O(y)) is as small as possible. Then O defines our fractal approximation 
@. To store @ it suffices to store O. In the end, one thinks of the controllable family 
of operators © as the fractal system. 

Whenever we have the ingredients 1, 2, and 3, we have a fractal system. A frac- 
tal system is a mechanism for finding fractal approximations @ for models w € Y. 
Examples (i), (ii), and (iii) above constitute interactively controllable systems for 
fractal modelling applied to diverse problems: in (i), sets of points are approximated 
by fractals made of sets of points; in (ii) measures are approximated by fractal mea- 
sures; and in (iii), continuous functions are approximated by fractals that are graphs 
of continuous functions. In each case the parameters that control the problem are 
embedded in the operator O, which is selected and tailored to produce an appara- 
tus for fractal approximation, a fractal system, appropriate to the modelling problem 
under consideration. 

We note that fractal systems provide deterministic fractals rather than stochastic 
fractals. Once the operator O has been selected, its attractor, the fractal that it de- 
fines, has been fixed once and for all. There is only one fractal associated with each 
operator. These fractals are typically low information content members of a larger 
space, the space of models in which they lie; in much the same way as the rational 
numbers are finite information content members of the information rich space of all 
real numbers. 

In the following sections we develop more elaborate fractal systems, based on 
recurrent IFS. Their introduction was begun in section | of this chapter. These gen- 
eralize earlier systems in an elegant manner. They can be used to design machines 
to do elaborate fractal modelling, and illustrate well the design principles enunciated 
above. 


2 Recurrent iterated Function Systems 


Recurrent iterated function systems were introduced in [BEH],” where more infor- 
mation can be found. 


2M. Barnsley, J. Elton, D. Hardin, Recurrent Iterated Function Systems, Constructive 
Approximation (1989) 5, 3-31. 
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Table X.1. Example of a recurrent IFS code. Not all transitions are possible! 
wi ae Bh ee Wi Ais Tg 


0.0. dis 0 0:03... O.)2 dD 
22: ¢ AL... Gi O..2128 0 0.6 0.4 
0.0 Ls 2th. ten... 09.0 0.5 


Definition 2.1 A recurrent iterated function system consists of an IFS {X; wy, 
W2,..., Wy} together with a matrix {p;; € [0, 1]:1, j =1,2,..., N}, such that (i) 


Pi + Piz + Pix +---+ Pin =1 fori=1,2,...,N 


and (ii) such that for any i and j, there exists a finite sequence of integers k,l,..., 
m € {1,2,..., N} so that 


PikPki*** Pmj > 0 


The transition probability for a certain discrete time Markov process is p;; , which 
gives the chance, or probability, of moving from state 7 to state j, given that the 
process is in state 1. Condition (i) says that the process is row stochastic: whichever 
state the system is in, a set of probabilities is available that sum to 1, and that 
describe the possible subsequent states to which the system can transition at the 
next step. Condition (11) says that if the system is in state 7 then there is a finite 
probability of reaching state j in a finite number of steps, for any pair of integers 
pe PQ 

An IFS with probabilities provides a simple example of a recurrent IFS. The IFS 
with probabilities 


{X; w,, W2,..., WN; Pi, P2,--+5 Pn} 


is in many ways the same as the recurrent IFS 
{X; wi, w2,..., wy; (pij)} 


when p;; = p; for alli, j € {1,2,..., N}. 

We will say that a recurrent IFS is hyperbolic if the associated IFS is hyperbolic, 
that is, contractive with contractivity factor 0 < s < 1. We restrict attention to hyper- 
bolic recurrent IFS. 

Examples of codes for recurrent IFS’s are presented in Tables X.1 and X.2. In 
each case the space is X = R?, the transformations are affine, and we use the usual 


e i d; y sf ; : 


Figures X.266 and X.267 provide diagrams of the associated Markov processes. 
We say that the system is in state i if the last transformation to be applied was w;. 
For the the recurrent IFS in Table X.1 it is never possible to apply transformation 
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Table X.2. Example of a recurrent IFS code. All transitions can occur! 


wi ed; e; i a Pe TS 
1 +. 4 G3 0 0.33 ..65...03 
os. =). 0.05 378-0 UM. BS 4 

3 -@5 0 .0- 023 3 bs 64 024 G2 


Figure X.266. Markov 
process for the recurrent 
IFS given in Table X.1. 
There is no path from state 
1 to state 3, state 2 to state 
1, or state 3 to state 2. 


Figure X.267. Markov 
process for the recurrent 
IFS given in Table X.2. 
Paths exist between all 
possible states. 


number 3 if the system is in state 1. It is also not possible to make a transition from 
state 2 to state 1. For the recurrent IFS in Table X.2, any transformation can follow 
any other one; that is, the system can transition from any state to any other state, in 
one step. 

A recurrent IFS possesses a unique measure theoretic attractor, [BEH]. This is an 
invariant measure that can be computed by means of a generalized “Chaos Game” 
algorithm. It can also be computed using a wierd “Photocopy Machine.” 


Example of the Chaos Game Algorithm for a Recurrent IFS. We 
present this algorithm for the case of the the recurrent IFS codes in Tables X.1 and 
X.2. The algorithm yields an orbit {(x,, y,):n =0, 1,2, ...} that, with probability 
one, describes the unique invariant measure associated with the recurrent IFS. The 
relationship between the orbit and the measure is much the same as that described in 
Chapter IX for the case of standard IFS with probabilities. 


(1) Choose a starting point (xo, yo) € R’. It is desirable that the starting point lies 
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as close as possible to the attractor. Clearly this lies in [0, 256] x [0, 256]. 
Thus, we might choose (xo, yo) = (0, 0). 

(2) Choose a initial state so = {1, 2, 3}. Any other state in the set {1, 2, 3} could 
be used. The Markov process associated with the transition matrix (p;;) pos- 
sesses a unique stationary vector (m1, m2, m3) with m; > 0, such that 


3 
> mipy =mj, jai, 2:3. 
i=l 
If one knows this vector, then it is a good idea to choose the initial state to be 
the one corresponding to the largest m ;. 
(3) Select s; € {1, 2, 3} with probability p,,; associated to the choice 5; = j. 
(4) Compute (x1, yi) = Ws, (Xo, Yo)- 
(5) Select s2 € {1, 2, 3} with probability p,, ; associated to the choice s2 = j. 
(6) Compute (x2, y2) = Ws,(%1, yi). 


(2n + 1) Select s, € {1,2,3} with probability p,, ,; associated to the choice 
Sy: 


(2n + 2) Compute (xn, yn) = Ws,(Xn-1, Yn—1)- 


The result of the above calculation is supposed to be a long orbit 
Q(numits) = {(Xn, yn) :n = 1, 2, .., numits}. 


(With probability one) the value of the invariant measure « € P((0, 256] x [0, 256]) 
of the recurrent IFS is given by the formula 


number of points in {S M Q(numits)} 
lim p(s) = ——— ec —— 
numits—> oo numits 


for any measureable subset S$ C R?. 


In Figure X.268 we illustrate this algorithm for the case of the recurrent IFS in 
Table X.1. In Figure X.269 we give the results of a computation, showing a long 
orbit. The invariant measure can be pictured by observing the density of the points. 

In Figure X.270 we illustrate this algorithm for the case of the recurrent IFS in 
Table X.2. In Figure X.271 we give the results of a computation, showing a long 
orbit. Again, one gets some idea of the invariant measure by looking at the density 
of the points. 

In Figure X.272 we show the structure and the attractor for the recurrent IFS in 
Table X.3. This involves four transformations which take [ into itself. The range of 
w; is the quadrant of ZF labeled i, so that the transformations provide a just-touching 
collage of the square. The transition probability p24 is zero, which means that no 
points on the attractor lie in the subquadrant labeled 24. 
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Figure X.268. Chaos 
Game for a recurrent IFS 
in which not all transitions 
are possible. The recurrent 
IFS code is given in Table 
X.1. 


IN STATE 1 || INSTATE 2 || INSTATE 3 


Jump to State 1 || Jump to State 2 |} Jump to State 3 
with Prob 0.3 with Prob 0.6 with Prob 0.5 

Jump to State 2 || Jump to State 3 || Jump to State 1 
with Prob 0.7 with Prob 0.4 with Prob 0.5 


—— @. (Xe Yves) 


The measure of S 
' is the proportion 
XV of points which 

—s X land in S 


In the (x,y) plane, when the system jumps to State j, apply the map 
w; to the current point (x,,y,,) to obtain (Xy44YnNe4) = Wj (Xap) 


Thus produce a dancing point. 


Figure X.273 shows the attractor for the maps in Table X.3, for six different sets 
of transition probabilities. These sets of transition probabilites used were: 


(1) pi; = 0.3, pi2 = 0.5, pi3 = 90.1, pig = 0.1, 
pr = 0.1, pr2 = 0.5, p23 = 0.4, prs = 0.0, 
p31 = 0.3, p32 = 0.3, p33 = 0.2, p34 = 0.2, 
Pai = 0.25, Pao. = 0.25, P43 = G25. P44 = 0.25; 
(2) pir = 0.3, pi2 = 0.5, pi3 = 0.1, pig = 0.1, 
p21 = 0.1, po = 0.5, p23 = 0.4, pr4 = 0.0, 
p31 = 0.3, p32 = 0.3, p33 = 0.2, p34 = 0.2, 
pai = 0.25, pag = 0.25, p43 = 0.5, Pag = 0.0; 
(3) pu =0.3, pi2 = 0.5, pi3 = 0.1, pig = 0.1, 
p21 = 0.1, pox = 0.5, p23 = 0.4, pr4 = 0.0, 
p31 = 0.0, p32 = 0.3, p33 = 0.5, p34 = 0.2, 
Pa = 0.0, P42 = 0.5, P43 = 0.5, P44 = 0.0. 
(4) py; = 0.0, pi2 = 0.5, pi3 = 0.2, pig = 0.3, 
P21 = 0.1, por = 0.4, pr3 = 0.4, prg = 0.1, 
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Figure X.269. Some Y 
points on the orbit pro- 

duced by the machinery in 256 
Figure X.268. 


p3, = 0.25, p32 = 0.25, p33 = 0.25, p34 = 0.25, 
Pai = 0.4, paz = 0.4, p43 = 90.1, pag = 0.1. 
(5) piy = 0.25, pi2z = 0.25, p33 = 0.25, pyg = 0.25, 
| P21 = 9.25, por = 0.25, p23 = 0.25, po4 = 0.25, 
p31 = 9.25, p32 = 0.25, p33 = 0.25, p34 = 0.25, 
pai = 0.25, paz = 0.25, paz = 0.25, pag = 0.25. 
(6) pi: = 0.0, pi2 = 0.34, pi3 = 0.33, pig = 0.33, 
P21 = 9.25, p22 = 0.25, p23 = 0.25, po4 = 0.25, 
ps: =0.25, paz = 0.25, Pa = O20, Pag = US, 
Pa; = 0.25, pag = 90.25, paz =0:25, pag = 0.25. 


Can you work out which set of the above transition probabilities corresponds to each 
of the images in Figure X.273? 


Example of the Photocopy Machine Algorithm for realizing the in- 
variant measure of a recurrent IFS. We illustrate this algorithm for the case of 
the IFS code in Table X.1. It is illustrated in Figure X.274. 

The mathematical machine in Figure X.274 corresponds to the recurrent IFS 
in Table X.1, wherein there are three affine transformations w,;, w2, and w3, and 
wherein the transition probabilities p,3, p21, and p32 are all zero. The states of 
the system are represented by the three copy units, one of which is labeled IN- 
PUT1/COPY 1; the others are labeled INPUT2/COPY2 and INPUT3/COPY3. Each 
unit has two lens/filter components. The first copy unit is made of two lens/filter 
components; one of these components applies affine transformation w, to INPUT1, 
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Figure X.270. Ran- 
dom walk generator as- 
sociated with the recurrent 
IFS in Table X.2. 


IN STATE 2 || IN STATE 3 


IN STATE 1 


Jump to State 1 
with Prob 0.3 


Jump to State 1 || Jump to State 1 
with Prob 0.1 with Prob 0.4 
Jump to State 2 || Jump to State 2 || Jump to State 2 
with Prob 0.6 with Prob 0.5 with Prob 0.4 
Jump to State 3 || Jump to State 3 || Jump to State 3 
with Prob 0.1 with Prob 0.4 


with Prob 0.2 


Y 


X 


Apply map w, when system jumps to State j. 


attenuates the brightness of the image by factor pj;, and directs its output to COPY 1; 
the other of these components applies affine transformation w2 to the input im- 
age, attenuates the brightness of the image by a factor p;2, and directs its output 
to COPY2. The second copy unit has two lens/filter components; the first applies 
affine transformation w2 to the input image INPUT2, attenuates the brightness of 
the image by factor p22, and directs its output to COPY2; the other of these com- 
ponents applies affine transformation w3 to the input image INPUT2, attenuates the 
brightness of the image by a factor p23, and directs its output to COPY3. The third 
copy unit has two lens/filter components; the first applies affine transformation w3 to 
the input image INPUT3, attenuates the brightness of the image by factor p33, and 
directs its output to COPY3; the other of these components applies affine transfor- 
mation w} to the input image INPUT3, attenuates the brightness of the image by a 
factor p31, and directs its output to COPY 1. 

It is important to note that the three inputs and the three outputs are set up so that 
they all represent the same subset 0 of R?. The range of transformation w lies in 
COPY 1, the range of w2 lies in COPY2, and the range of w3 lies in COPY3. Thus, 
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Figure X.271. “Pic- 
ture” of the point-set at- 
tractor corresponding to 
the fractal system repre- 
sented in Figure X.270. 


0 256 y 


Table X.3. Recurrent IFS code. See Figure X.272. 
Ce CF Si P; i> fs fa 


& 
cS 
> 


+ to) 8 05 0 O Abs. US 62 0.1 
2° Soe 0: 1282 0.1 - -05;-. 0.4~ = 00 
3. 84... 9 - 0 5-42] 2 tS Oe ee 
a. GS 0-0 Gis O- 02 025 S22... 45 


in the first copy unit transformation w2:(— C in fact takes INPUT1 to COPY2, 
whilst in the second copy unit it takes INPUT2 to COPY2. 

The images that are projected onto the copy screens are superposed; places where 
images come in from several inputs add up. For example, the total brightness of the 
image in COPY 1 is pj; times the total brightness of the image in INPUT], plus p3; 
times the total brightness of the image in INPUT3. 

Copy images are recorded on “photographic paper” and put back on their respec- 
tive inputs, as indicated in the figure. The system is cycled numerous times until the 
output images cease to change; then symbolically 


OUTPUT] = p1; -OUTPUT1 + p3; -OUTPUT3 


OUTPUT2 = p12 - OUTPUT1 + px - OUTPUT2 
OUTPUT3 = p23 - OUTPUT2 + p33 - OUTPUTS. 
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Figure X.272. Four 
affine transformations 
provide a just-touching 
collage of (J. A recurrent 
IFS (Table X.3) made of 
these transformations is 
such that pong = 0. As a 
consequence, the attractor 
has no points in the square 
whose address is 24. 


Finally, the images OUTPUT1, OUTPUT2 and OUTPUTS3 are superposed. Symbol- 
ically, 


ATTRACTOR = OUTPUT1 + OUTPUT2 + OUTPUT3. 
Color Plate 37 illustrates a measure theoretic attractor of a related Photocopy 


Machine. Color Plate 40 illustrates another way of approaching the computation of 
the invariant measure of a recurrent IFS, using cameras and screens. 
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Figure X.273. All of 
these attractors correspond 
to different choices for the 
transition probabilities in 
Table X.3. 


3 Collage Theorem for Recurrent IFS 


In this section we are concerned with the construction of fractal systems, based on 
recurrent IFS, with an emphasis on point-set attractors, that is, attractors that are sets 
of points rather than measures. Our goal is to describe how a collage theorem and 
associated machinery can be set up to enable fractal modelling using the point-set 
attractors of certain recurrent IFS. 

A recurrent IFS can described in terms of a Markov chain which acts on a code 
space built from the symbols {1,2,..., N}, as illustrated in Figures X.268 and 
X.270. The numbers p;; = 0, Se pij = 1, give the probabilities of transfer among 
the symbols. One can imagine a particle moving from symbol to symbol following 
the discrete time Markov process. The process is defined, strictly speaking, to be 
recurrent if there is a finite probability of being able to move, on the directed graph, 
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\ 
\ 


EWR = 


Copies 1, 2 and 3 are combined (superposed) to create Final Image 


from any given symbol to any given symbol. A good source of information on 
Markov chains is [Feller, Ch. 15].° 
We are concerned with the hyperbolic case, namely, 


d(w;(x), wi(y)) < sd(x, y) Vi, Wx, yEX 


where d is the distance function on X and 0 <s < 1. In this case, we know that 
there exists a unique attractive invariant probability measure 4, which describes the 
random walk on X produced for example by the “Chaos Game” algorithm described 
in the previous section. Our focus in this section is on the structure of the support 
of 4, which we call A ¢€ X, the attractor of the recurrent IFS. This depends only on 
which p;; are nonzero, the connection structure of the Markov chain, and otherwise 
not on the values of the p;;’s. 

We begin by recalling how the analysis proceeds in the case of IFS theory; this 
provides the simplest framework for explaining the recurrent case. Let (X, d) denote 
a compact metric space X, with distance function d. Let 7 denote the set of all 
nonempty compact subsets of X. 


Definition 3.1 d(x, B) = minyeg d(x, y) Vx € X, VB € H. Note that 
BCC =d(x,C) <d(x, B). (1) 


3'W. Feller, An Introduction to Probability Theory and its Applications (Wiley, London), 
1957. 


Figure X.274. Photo- 
copy Machine for realiz- 
ing the invariant measure 
of the recurrent IFS in 
Table X.1. 
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Definition 3.2 d(A, B) = max,<,4d(x, B) VA, BE H. 
Note that this set distance is not symmetric. It has the following properties 


(i) BC C=>d(A, C) < d(A, B). This follows at once from (1). 
(ii) d(A UB, C) =d(A, C) Vv d(B, C), where x V y = max{x, y}. This follows 
from the observation that 


d(AUB,C)= max d(x, Cc) = max d(x,C)V max d(x). 


Definition 3.3 For all A, B € H, the Hausdorff distance is defined by 
-h(A, B) =d(A, B) Vv d(B, A). 
We recall that (H, h) is a compact metric space. 
Lemma 3.1 For all A, B,C, DEH, 
h(A UB, CUD) <h(A,C) Vv h(B, D). 


Proof 
d(AU B,C UD)=d(A,CUD)Vd(B,C UD) by (ii) 
<d(A,C)Vvd(B,D) by (i) 
<h(A, C) v h(B, D). 


The same argument yields d(C U D, AU B) < d(C, A) V d(B, D) < h(A,C) Vv 
h(B.D). 


Let {X; wj, j =1,2,..., N} be a hyperbolic IFS, with d(wjx, w;y) < sd(x, y) 
Vx, yéX, and0 <s < 1. Define 


W:H-H 
by 


N 
W(A) =|_J wj(A). 


j=l 


Theorem 3.1 W :#H — H is a contraction, with contractivity factor s, with 
respect to the Hausdorff metric; that is, 


h(W(A), W(B)) <sh(A, B) VA, BeH. 


Proof For any A, B € H, 
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N N 
h(W(A), W(B)) =h (U wj(A), (_) w(@) 


faz] j=! 


2 


< \/ h(w;(A), w;(B)) (by Lemma 3.1) 


j=1 


= VV {4 (wy(A), wj(B)) vd (wy(B), w/(A))] 


4 


< \/ {sd(A, B) v sd(B, A)} 


j=l 


= sh(A, B) 


We leave it as an exercise for the reader to show that the following extension of 
the above result is true. We will need this fact later on. If (X,, d;) and (X>, d2) are 
metric spaces, then (71;, h,) and (72, h2) are the corresponding spaces of compact 
nonempty subsets, if 


wj;:X,—> X2 for j =1,2,...,N, 


obeys 
d(wjx, wjy) <sd\(x,y) Wx, yeX, 
then 
W : Hy > AH 
defined by 
N 
W(A) =|) w,(A) 

j=l 

obeys 


h(W(A), W(B)) < sh\(A, B). 


Corollary 3.1 There is a unique set A € ‘H such that W(A) =A 
A is the attractor of the IFS. 
Corollary 3.2 Collage Theorem for IFS. /f B < H obeys 
h(B, W(B)) <e>0 
then 
ie 2 
(es) 
where A denotes the attractor of the IFS. 
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Proof By the contraction mapping theorem 
h(A, B)=h(B, lim W°"(B)) = lim h (B, W°"(B)) 
n—> co n—->0OoO 
where W°°(B) = W(B) and we inductively define 
wet) (B)=wW(W"(B)), n=0,1,2,... 


But, by the triangle inequality, 


h(B, W°"(B)) < > h(W™"Y(B), W"(B)) 


m=1 


=) hive Yew e))) 


m=1 


<)> s™'h(B, W(B)) 


mt 


<(1—s)'h(B, W(B)) 


We are now ready to extend the structure to the case of recurrent IFS. We actually 
make a generalization of the recurrent IFS structure to multiple spaces and set maps, 
suitable for the hyperbolic case where one is concerned with point-set topology is- 
sues. We are only concerned here with the connection structure of the chain. Let 
(X;, d;) be compact metric spaces, j € {1,2,..., N}. Let (H;, h;) denote the asso- 
ciated metric spaces of non-empty compact subsets which use the Hausdorff metrics. 
Let there be defined maps W;; : 7H; > HiV(i, j) € J where [ is some set of pairs of 
indices with the property that for each i € {1,2,..., N} there isa j € {1,2,..., N} 
with (i, j) € J. That is, /(i) = {j|@, j) € 1} 4 @ for eachi € {1,2,..., N}. Further- 
more let 


h; (Wi;(A), Wij(B)) < sijhj(A, B) 


for some number s;;, V(i, j) € 1, VA, Be H,. By the remark following Theorem 3.1 
such maps can be built up from point maps taking X; to X;. 

Suppose w; : X — X are contractive maps, where X is compact metric, and (pj;) 
is row stochastic. Define (X;, d;) = (X, d) for each j, and define W;;(S) = {w;(x) : 
xéS}, i,j=1,...,N. Let 1(@i) = {j: pj > 0}. This embeds us in the more gen- 
eral setup that we now study. 

Let 


H=H; x He x Pax see x Hy, 
and endow H with the metric h defined by 
h((A1, Ao, ---, Aw), (Bi, Bo,..., Bw)) = max {hj(Aj, Bj)|j =1,2,...,N}. 


Then it is readily demonstrated by the enthusiatic reader that (H, h) is a compact 
metric space. 


3 Collage Theorem for Recurrent IFS 


We think of 1 as consisting of a stack of clipped planes X,, X2,..., Xv witha 
point in 1 being the N-tuple of one image in each plane. This idea is illustrated in 
Color Plate 41. Define 


~ 


W:H-H 
by 


W(Aj, Az, ..., Av) = ( U wit), UJ wi)... U vwi(A)) 


jel (i) jel (2) Jéel(N) 


For example, in the case N = 2 such a mapping might be symbolized 


#(4)=( 0 ) (4) =f Wi2(A2) ) 
A2 Wr W22) \ Ao W21(A1) U W22(A2) 

We have now arrived at our goal; we are able to characterize W as a contraction 
mapping on 1; we thus have all of the ingredients of a fractal system. 


Theorem 3.2 W:H— H obeys 
h (WiA), w(B)) <sh(A, B) VA, BEeH. 
where s = max{s;;, (i, j) € I}. 
Proof To keep the notation succinct we suppose 
~ Wi Wie 
Ww 
( Wr Wa ) 
Then if A = (A, Az) and B = (B;, B2) we have 
h (W°A), w(B)) 
= h ((W11(A1) U Wyi2(A2), W21(A1) U W22(A2)) , 
(W11(B,) U Wi2(B2), W21(Bi) U W22(B2))) 
= max{h,(W,1(A1) U Wj2(A2), Wi1(B1) U Wi2(B2)), 
h2(W21(A1) U W22(A2), W21(Bi) U W22(B2))} 
< max{h,(W11(A1), Wii (B1)) V 41(Wi2(A2), Wi2(B2)), 
h2(W21(A1), Woi(Bi)) V h2(W22(A2), W22(B2))} (by Lemma 3.1) 
< max{s1;h;(Aj, Bi) V 512h2(A2, Bz), 52141(A}, By) V 522h2(A2, Bo)} 
<s(hy(Ay, By) V hp(A2, B2)) = sh((Aj, Az), (By, B2)) 
= sh(A, B). 


The fact that W is a contraction mapping on H means that we have a fractal 
system as described in the first section of this chapter. Conditions 1, 2, and 3, of 
section 1 apply. In this case the underlying space is X. Multiple copies of X are 
used to make up a space of models Y = H. (It may be that we only care about one 
“plane” in H.) A contraction mapping O = W on H is built up out of elementary 
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transformations acting on X. Fixed points of such operators O provide the fractal 
approximations to the models. In the present case the specific structure is as follows. 


Corollary 3.3 When s < | there is a unique element 
A=(Aj, A2,..., Aw) €H 
such that 
Aj = Ujena Wij(Aj) fori =1,2,...,N. 


That is to say, W(A) = A. A is the attractor of the recurrent IFS. 

Corollary 3.4 A Collage Theorem for recurrent IFS. [f B < H obeys 

h (B, W(B)) <€>0 
then 
n(B, AS — 
9 os (1 oes s) >] 

where A denotes the attractor of the recurrent IFS. 

To connect this with the original single-space point-map recurrent IFS, we have 


Corollary 3.5 Let {X; w;, (pij),i, j =1,...,N} be @ recurrent IFS with X 
compact and the w;’s uniform contractions. Let A be the support of the unique sta- 
tionary measure 4. mentioned in section 2. Then there exist unique compact sets 
A:C A, i=1,...,NwihaA= UN _,Ai such that 


Aj = Uj:p;,>0Wi(A;), i= fee. = 


In terms of the random walk produced by the Chaos Game Algorithm, as illus- 
trated in section 2, the A;’s may be characterized as follows: for all x, x € A; iff for 
every neighborhood G of x, for almost all trajectories xp, w;,(Xo), Wi, (Wi, (x0), .- +; 
we have i, =i and w;,(...(w;,xo)...) € G for infinitely many n. In other words, to 
“see” Aj, just look at the points along a trajectory that end in map uj. 

Let us use the above ideas to construct a simple machine (fractal system) for 
modelling some binary subsets of 0. The machine is illustrated in Figure X.275. It 
uses affine transformations w; :O > 0, i = 1, 2, 3. Place a copy of the subset T CO 
on (1), and place adjustable subsets U, V C Don (2) and (3) respectively. Adjust the 
coefficients a;, b;, ci, d;, e;, f; and the two subsets U and V, so that 

T ~w,(T) Uw2(U) Uw3(V) 

U~w,(T)Uw2(U) 

Vrwu(T)Uw3(V) 
Assuming that the system can be adjusted so that these equations hold approxi- 
mately, while the transformations remain suitably contractive, we know that the at- 
tractor for the system, our fractal model, which will be represented by a vector of 
sets (7, U, V), will be such that T ~ T. This simple machine generates an 18 pa- 
rameter family of fractal models for subsets of CO. 


Fractal Transformations 


This collection of color plates tells a story. After writing Fractals Everywhere 
(1988) I worked for ten years with Alan Sloan, John Elton, Stephen Demko and 
others, to develop successfully a method for compressing digital photographs 
using local IFS theory; the resulting technology was used by Microsoft. But the 
pictures in this section were produced during the last seven years, and relate to 
a fascinating direction for research, namely transformations between fractals. In 
the spirit of The Fractal Geometry of Nature (1981) by Benoit B. Mandelbrot, 
these pictures provide further hints concerning connections between fractal 
mathematics, pictures of nature, and art. 
New technical terms are written in italics where they are first introduced. 
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Plate 46 


This plate illustrates two superimposed IFS attractors. The colors come from 
color-stealing, a recently developed IFS technique that you can read about at www. 
superfractals.com. Each IFS comprises three simple geometrical transformations: 
can you find them? But the subjective point of the picture is this: it may connect 
the viewer to ideas of butterflies, flowers, angels and art deco line drawings. 


Plate 47 


Top: the top of an overlapping IFS attractor surrounded by colored contours 
at constant distances from it. The IFS has four affine maps: but copies of the 
attractor are laid down one on top of another in a particular order and rendered 
in different colors. Then the resulting colored attractor is again acted on by the 
IFS, the four copies again being laid down on top of one another in the same 
particular order, each color being replaced by four new colors. This process 1s 
iterated and might yield an inchoate image: however, because of the way the 
colors are stolen from digital photograph of nature, the result is quite continuous. 
This picture results from merging fractal geometry with the geometry of a digital 
photograph of nature. Both sources inform the resulting picture. Bottom: shows 
a close up on a fractal fern surrounded by contours of constant distance. Imagine 
how the contours look as you get closer and closer to the fronds. Note that a 
computational effect has roughened some of the contours. 


Plate 48 


Just-touching attractor of an IFS of three bi-affine functions, computed 
using the chaos game algorithm and rendered using color-stealing. Bi-affine 
functions are affine in two directions through each point in space. The colors 
were stolen from a digital photograph of autumn leaves lying on the ground. 
Note that the picture shows all of the attractor —it has three square corners. 
Try to see how the shape of this picture is the union of three transformed 
copies of itself. Note the biological character of the small structures within 
the image and the interplay of soft colors. 


Plate 49 


Top right, bottom left and bottom right: attractors of just-touching IFSs of three 
or four bi-affine functions, rendered using color-stealing. Top left shows a close-up 
of the attractor at top right, with colors stolen from a different digital photograph 
of nature. The pictures are incomplete: they were computed using a version of the 
Chaos Game algorithm, but the associated invariant measure has very little mass 
in the corners of the square. In the image at top right, this lack of mass is reprised 
by the central four-leaf structure, and thence repeated throughout the image. All 
four of these abstract images are informed by fractal geometry, including measure 
theory aspects, as well as the geometry of nature. 


Plate 50 


A picture of a fractal top (see also Plate 46): this represents the top of the 
attractor of an IFS with condensation set: the condensation set is a picture 
of a leaf; the attractor is the unit square; the IFS consists of four bi-affine 
functions. By looking at this image for a few minutes you should be able to 
figure out how the four maps act on the whole square in a bi-affine manner. 
But note how the original leaf dominates or “lies on top of” its children, 
who lie on top of their children, and so on.... The colors in this picture are 
arbitrary and synthetic: they were not stolen from nature. 


Plate 51 


First example, in this sequence of plates, of a fractal homeomorphism As 
we go though these plates we are transitioning from basic IFS theory to 
transformations of real world images using fractal geometry. What you are 
actually looking at here is homeomorphic to a picture of a crossword puzzle. 
In this example, and others that follow, two carefully chosen just-touching 
bi-affine IFSs are used, each of which has the filled unit square as its attractor. 
The homeomorphism f from the square to itself is constructed by mapping 
the point x to the point f(x) that has the same set of addresses (relative to the 
second IFS) as the point x (relative to the first IFS.) Conditions under which 
such transformations are indeed homeomorphisms, even when the two IFSs 
are overlapping, is a subject of current research. See www.superfractals.com. 


Plate 52 


I laid oranges and lemons and some leaves from the lemon tree (see Plate 57) 
in a rectangular array on the ground. I separated some of the rows of fruit 
by black electrical tape, and photographed the result. I then applied, to the 
digital photo, the same fractal homeomorphism as the one used to produce 
Plate 51. In each case the attractor, the unit square, is the same, and the way the 
transformation warps the interior of the square is the same: but the resulting 
pictures are different because one can see only how boundaries of regions of 
nearly constant color are deformed. Note that a fractal homeomorphism may 
change the fractal dimension of a set upon which it acts. 


Plate 53 


A similar fractal homeomorphism to the one in Plates 51 and 52 has here been 
applied to a picture of fallen autumn leaves. I was astonished at the coloring, 
so vibrant, in the transformed picture, and had to look again and again at 
the original to convince myself that there was not an error in the computer 
implementation. This suggests that our familiarity with pictures of nature, of 
clouds and foliage, of mountain vistas and fields of flowers, leads us to not be 
able to see such things very well; we become easily habituated to the category 
of the beautiful nature that we are looking at and so, almost, become a tiny 
bit bored, or economical in our studying of the detail, replacing what we see 
with a cheap, redundant, mental model: fractal homeomorphisms may allow 
us to see nature again, as though for the first time. 


Plate 54 


Fractal transformation of seaweeds on a beach in Western Australia. The 
fractal transformation here is quite different from the ones used in Plates 
51, 52, and 53, but again the underlying attractor is the unit square and 
bi-affine IFSs are used. The stretching and shrinking is more extreme, and 
a clear pattern of quadrilaterals can be observed. Certain expanded features 
correspond to very few pixels in the original photograph, while other regions 
are greatly compressed. Note the patterns in the corners: can you deduce the 
bi-affine just-touching collage? 


Plate 55 


This picture illustrates a continuous fractal transformation that is not a 
homeomorphism. The green-grey-brown part of this picture is supported on 
the attractor of a bi-affine IFS of six transformations. Turned on its side, 
this attactor comprises the disjoint union of a non-denumerable collection 
of graphs of fractal interpolation functions: any horizontal line through the 
unit square intersects the attractor in a Cantor set. The fractal dimensions 
of these graphs are varied, some equal to one and others greater than one. 
A continuous transformation from this attractor onto the unit square is 
constructed in much the same way as the fractal homeomorphisms discussed 
in the caption to Plate 51. By composing such a fractal transformation with 
a picture supported on the unit square, we obtain a picture supported on the 
attractor. In this case, a picture of a tall wild grass plant has been transformed; 
now we see the colours from the original picture laid out in strands, no longer 
lost in the familiar shape of a tussock of grass. 


Plate 56 


This illustrates a photograph of flowers in a conservatory, treated as a 
condensation set, being drawn into the attractor of an IFS of two contractive 
similitudes. But the copies of the condensation set have been put down in a 
particular ordering, one on top of another, analogously to the situation in 
Plate 50. Note the interplay of the crimson spiral of flowers with the leafy 


fractal boundary. 


Plate 57 


This picture contrasts a fractal homeomorphism with a fractal transformation 
that is merely continuous almost everywhere. The top picture is a digital 
photograph of a lemon tree and wallflowers. The middle picture is a bi-affine 
fractal homeomorphism of the top picture. The bottom picture is part of 
a fractal transformation of the original image, using a twin-dragon fractal, 
using a fractal top to define the mapping between points on the unit square 
and points on the twin-dragon. 


Plate 58 


Before and after fractal homeomorphism of trees, rocks, grasses, plant pots, 
sleepers, and decking. The transformed picture helps me to see the original 
picture with fresh eyes; when I was young the world seemed magical and 
strange — I get that feeling now when I visit new landscapes, but it fades 
with time; each new fractal transformation makes me regain that sense of 
strangeness that should, I think, always be there. 


Plate 59 


The two artworks reproduced here hang on the walls of my home. The top 
picture attempts to merge the figure with the landscape, sandunes and grasses, 
via fractal homeomorphism and measure theory. The original is printed at 
very high resolution on good quality paper. One of the great challenges of 
art is to capture the nude form in a manner that repositions it in space and 
time, to fill us with wonder about ourselves. The bottom image comprises two 
fractal Nerieds flowing in ripples formed by fractal homeomorphisms built 
from fractal interpolation functions. I also have an animation of this picture; 
they waver and vary in a watery way as the homeomorphism flows over them. 


Plate 60 


Fractal transformations can be used to break up a picture into a fractal jigsaw 
of pieces that can then be put back together with black seams between the 
pieces: the resulting “stained-glass effect” pictures, here of my daughter Rose 
and my wife Louisa, have been set in a picture of real glass. What you will 
notice, on looking closely, is that the transformed pictures are complete: you 
could in principle reconstruct the original photos from them by inverting the 
fractal transformations. Nature looks in, fractally transformed by the optical 
distortions produced by the glass, in the middle panel. 


Plate 61 


This picture was made by Louisa Barnsley using a fractal camera. Each 
component image was obtained in the obvious manner using a triangular 
picture as condensation set for a Sierpinski triangle IFS. In each case, the 
original picture and the fractal set seem to interact to produce something 
more than either individually. 
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A simple machine for modelling some Figure X.275. Imag- 
binary subsets of CJ. ine that you have control 
knobs so that you can ad- 
just the affine coefficients. 
w;(%)\ me (2 2 a J 
y cj ai} \y fj 


Another simple point-set fractal system is suggested by the diagram in Figure 
X.276. This type of machine generates recurrent fractal interpolation functions. The 
coefficients of the transformations are constrained to make sure that the resulting 
attractors are graphs of continuous functions, as follows: 


w1(X0, Yo) = (Xo, Yo); w3(X2; Y2) = (%3, y3), W5(X2, Yo) = (x4, ya) 
W1(X2, y2) = (%1, yi), w3(X0, Yo) = (%2, y2), W5(X4, Y4) = (X5, Ys) 
w2(x2, y2) = (X41, yi), wa4(x4, v4) = (x3, y3), W6(X4, y4) = (Xs, ys) 
w2(x4, y4) = (x2, ya), wa(X6, Yo) = (%4, ya), W6(X6, Yo) = (%6, Y6)- 
One example of a commercial fractal system, useful for educational purposes and 
for desktop publishing applications, is the Desktop Fractal Design System, [DFDS].* 
This is based on simple recurrent IFS theory using affine transformations in two di- 
mensions. This can be used to illustrate some aspects of modelling using point-set 
attractors of recurrent IFS built up of affine transformations acting in two dimen- 
sions. 


+The Desktop Fractal Design System, Version 2.0, for MacIntosh and IBM PC. Published 
by Academic Press, 1992. 
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Figure X.276. A frac- 
tal system constrained 

so that its attractors are 
graphs of continuous func- 
tions. 


Consider the IFS for a fern, as introduced in Chapter III. We refer to this as Fern 
1. Its IFS code is given in Table X.4 and it is shown in Figure X.277. 

In Figure X.278 we illustrate how one can create the same picture of Fern 1 by 
using a duplicate copy of the space 1) with the same IFS acting on that space, and 
allowing a transformation to map from one copy of the space to the other. 

Look closely at the structure of Fern 1. Notice how the main fronds alternate as 
we go up the main stem. As we saw in Chapter III, this same structure is repeated 
at all scales. If we zoom in on the tiniest frond, we will find that all the frondlets 
alternate up their respective stems. 

Now suppose we modify our more complex recurrent system by adjusting the IFS 
for the first copy of the fern, that is we adjust the affine coefficients so that Fern 1 on 
the first copy of D becomes Fern 2, as illustrated in Figure X.279 and Table X.5. 

Notice that the fronds in Fern 2 are opposed instead of alternating. Finally, in 
Figure X.280, we show the effect of the modifications on the attractor as a whole. 
The original copy of the fern now has opposed fronds, some of which are made of 
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Figure X.277. A bi- 
nary fern image is the 
point-set attractor for a 
single fractal system con- 
sisting of an IFS of affine 
maps acting in R?. It is 
closely related to more 
complex fractals based on 
recurrent IFS theory. 
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Figure X.278. This 
shows how the attractor 
in Figure X.277 is also 
the attractor of a recurrent 
fractal system, involving 
two copies of [ instead 
of one. The attractor of 
the IFS appears as part of 
the attractor of a recurrent 
IFS. 
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Figure X.279. The IFS 
acting on the first copy of 
Cis modified to create a 
new fern, Fern 2. 
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Figure X.280. The 
result of the modification 
of the first copy of the fern 
is to fundamentally alter 
the structure of the part of 
the attractor appearing on 
the first copy of D. 
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Fundamentally 
modified fern. 
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Table X.4. IFS code for Fern 1. 


w a b Cc d e f 

1 002 0.359 0:35 7030 9 106 
2 0.68 0.17 -0.17 0.68 73 11 
3 0.05 -0.80 -0.22 -0.19 286 131 
4 0.02 -0.36 0.00 038 160 £97 


w a b Cc d e t 

1 -0.02 0.39 -0.31 030 64 116 
2 064 0.16 -0.16 064 #467 15 
3 0.00 -0.80 -0.22 0.01 237 147 
4 0.20 -0.33 0.00 035 144 96 


smaller ferns with alternating fronds. We are learning to control a recurrent fractal 
model! 

Finally what happens if we feed the first copy back into some of the fronds on the 
second copy? The results of such an adjustment are shown in Figure X.281. In fact, 
in this example, the bottom-most fronds on each fern are affine transformations of 
the other fern. 

Color Plate 36 shows a fern image, and zooms on it, produced by a fractal system. 
In this case the fractal system involves four copies of 0 and a recurrent structure of 
affine transformations of the type discussed in this section. In this section each of 
three successive zooms on one of the ferns reveals new structure. 

Another example of an attractor, consisting of four point-sets at once, in shown in 
Figure X.282. It is good to think of the vector of sets as a single entity. 


4 Fractal Systems with Vectors of Measures as Their Aftractors 


In this section we are concerned with the construction of fractal systems, using 
recurrent IFS, whose attractors are vectors of measures. Our goal is to show how the 
collage theorem and associated machinery can be set up to enable fractal modelling 
using the measure theoretic attractors of certain recurrent IFS. That is, recurrent IFS 
can be used to make fractal systems suitable for modelling measures. 

Recall what happens in the case of an IFS with probabilities: for example, sup- 
pose X =O C R?, {X; wy,..., wy; p1,--., pw} is an IFS with probabilities, as in 
section 6 in Chapter IX, with the w;’s contractive affine, and Y = P(X); then we 
can define a corresponding fractal system by taking O to be the associated Markov 
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Figure X.281. A frac- 
tal system, based on a 
recurrent IFS structure, 
produces two ferns whose 
structures are woven to- 
gether. Neither could exist 
without the other! 


operator as defined in section 6 in Chapter IX; that is O: Y — Y is defined by 
O(v) = M(v) = pivow,! + ppvow;! +--+ pyvowy!. 


Then we know from the theory of Chapter IX that O is contractive in the Hutchinson 
metric. 

What is the analagous structure in the case of recurrent IFS? We use the notation 
of the previous section, but restrict attention to the case of measures supported on 
three copies of X. Instead of using the measure space P(X), we work with the space 
of normalized Borel measures P that are defined on Borel subsets S of X x X x X 
of the special form 
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Figure X.282. Affine 
maps, linking four copies 
of (1, lead to a recurrent 
structure consisting of 
four linked sets. Adjust- 
ments of the coefficients 
lead to a wide variety of 
fractal modes in this frac- 
tal system. 


§ = (S;, S>, 83) € B(X) x B(X) x B(X). 


where B(X) denotes the Borel subsets of X. A point in P is described by a vector of 
measures 


pb = (f1, 12, 3), 


where each py; is a Borel measure on X, and 


[i (X) + f42(X) + w3(X) = 1. 


Then the measure of vector of subsets S = ($;, $2, $3) is 
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ju(S) = (Si) + 2(S2) + 13(S3). 


We define a generalized Hutchinson metric d on P by 


iG) = 00 | ([ sides - [ dn): F= Ui fe MEL} 
for all 2, v e P, where 
C= Bae (fi, fo, fz): fi: X @ R continuous, 


and | f(x) — f(y) < d(x, yy) Vx, y €X,i=1,2, 3}. 


Then, from the fact that (P(X), dy) is a compact metric space whenever (X, d) is a 
compact metric space, it follows that (P, d) is a compact metric space too. 
We are now ready to describe generalized Markov operators M, 


M:P—P, 
that are associated with recurrent IFS. We illustrate the idea for recurrent IFS such 


as the one in Table X.1, which is the subject of Figures X.268 and X.274. We simply 
define 


M (ji(S)) = (x pire (wy '(S1)). >> pi2te: (wy '(S2)). >> Dist (5's) 
i i i 
Our general expectation is that such operators are contractions on (P, d) when the 
transformations w; are contractions. 


We demonstrate that M has contractivity factor s when p;; = p;, and the w,’s are 
contractive with contractivity factor 0 < s < 1. We will need the identity 


[ 4 (11(8);) = 2 Pi [ fj (wj(x)) dv;{(x), 
which follows from a change of variables. Then 
d (16), M@)) 
= sup I» ( [ (f;d(M (ji) ;)) — ? fjd(M (0) ») :fel, 
j 


= sup | [ Smee me [ Xr ticwyeon ary Pf eL, 
if rF 


=a | [ Drvfientaran.— [Desfintoran: Fer} 
7 rf 


Let, for x € X, 
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i@y= (- S- pi fi(wj(a)), 8" D> pj fi(wj(x)), 87 E pisses : 
J j J 


Then f € L. Let 
L= {f EL: 7 is of the special form of i above, for some f € L}. 


Then we can write 
Aci), ina) = su |s f Fans f Fas: Fei}. 
Since L C L, it follows that 
d(M(o), M(jt)) < sd(®, i). 
This completes the demonstration of the contractivity of the recurrent “multilevel” 
Markov operator, in the special case p;; = pj. 

However, in the general case where the transition matrix P = (p;;) is not con- 
strained as above, the contractivity of the Markov operator is controlled not only by 
the contractivities of the w;’s but also by the behavior of the transition matrix. To 
understand this, consider what happens when the space X consists of a single point. 
Then P becomes the set 

{it = (1, M2, M3) € Rs wi + 2 + 3 = 1, “1 = 0, wa =O, 3 > O}. 


That is, a typical measure pz € P is described by a vector of length three and total 
mass one. The Hutchinson distance between two such vectors is simply the sum of 
the absolute values of the differences between the components as follows: 


d (jt = (M1, Ma, M3), D = (V1, V2, V3) = |i — vil + [2 — val + les — val. 
Then the Markov operator M : P > P is given by 


M (jt) = (141, M2, M3) P = (ui pi + M2pa1 + M3psi, Mipi2 + M2P2 + M3P32, 
L1Pi3 + 2P23 + 43P33), 


where 
Pit Pit Ps 
P=| pan p22 px 
P31 P32 P33 
It follows that 


d(M(), M@) < dG, i). 
However we cannot, in the general case, assert a contractivity factor smaller than 
one, as can be seen by considering the considering the case 


1 — 2e€ € € 
Ps € ] — 2€ € f 
€ € 1 — 2e€ 
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where € is very small and positive. For this transition matrix, we find that 
d (1((1,0, 0)), M(O, 0, 1))) = (1 = 3e)d ((1, 0,0), ©, 0, 1). 


However, so long as the Markov operator has a unique fixed point, there always 
exist metrics such that the operator is contractive, in fact, with an arbitrarily small 
contractivity factor. The problem is the identification of such a metric, one that is 
simultaneously intuitively or visually accessible. 

The operator M : P(X) — P(X), associated with an IFS with probabilities, 


M(v) = piyvow;' + povow;' +---+ pyvow,y, 


is an example of a measure preserving transformation on a metric space, constructed 
from contractive transformations. 

Given a set of spaces, {X,:k =1,2,..., N}, we can construct measure preserv- 
ing transformations M;; :P(X;) — P(X;), according to 

ei aif at 
Mij(v) = pijiv ° Wij, + Pij2V © Wijy + +++ + PijnjY O Wy,» 
where wij, : X; — Xj; for n € {1,2,..., Nij} is contractive between the metric 
spaces (X;, d;) and (X;, d;) according to 
d; (wijn(x), wijn(y)) <8 -dj(x, y) forall x, y EX, 


and the pj;;,’s being positive numbers such that 
Do Pin = 1. 
n 


(One can think of {X;, Xj; wijn, Dijnn = 1, 2,..., Nij} as being an “IFS” that acts 
between the spaces X; and X;.) We now introduce an N x N transition matrix 
P = (p;;), that is row stochastic, and gives the probability of transiti n from space 
X; to X;. Then we can define a Markov operator M:P —>P, where P = P(X) x 
P(X2) x +++ x P(Xw), by 


M (1, May +++) hn) = (x PiMin(ui), )» Pi2Mia(ui),..-. > piv) 7 


In general we expect such operators to be contractive, and to provide the basis for 
useful fractal systems for modelling vectors of measures, and in particular, grayscale 
photographs. An example of such a system is VRIFS™,° which is aninteractive image 


modelling system that uses a fractal system built from operators of the type of M 


> VRIFS™, which stands for Vector Recurrent Iterated Function System, is an interactive 
measure theoretic fractal system for modelling images in two dimensions. It runs on Sun 
workstations and uses four-dimensional vectors of measures under affine control. It is avail- 
able from Iterated Systems Inc., Commercial Division, 5550-A Peachtree Parkway, Norcross, 
Georgia, 30092, USA, telephone number (404) 840 0633. 
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Figure X.283. A spe- 
cial Markov operator acts 
on a vector of measures. 
Its attractor is a stack of 
images, including a split 
root. Fractal systems of 
this type are good for frac- 
tal modelling of grayscale 
images. 


above, with all transformations being two dimensional affine transformations. An 
example of an image generated by VRIFS™is shown in Figure X.283.See also Color 
Plates 37, 38, and 39. 
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Selected Answers 


Answers to Chapter Il, section 1] 


II.1.8 The Riemann sphere is not a vector space because of the action of the ‘Point at Infinity’. To be a vector space, each 
point must have an additive inverse, that is, another point which summed to it yields the additive identity element 0. Since 
there is no such inverse for oo, the space is not a vector space. Note that the usual check a(x + y)=ax + ay does hold. 
The examples in 1.7 are not vector spaces for the simple reason that they are not closed under algebraic operations. 
That is, multiply any element in X by a large enough scalar, and it will not be in the space. The example in 1.6 has the 
same problem. If one defines multiplication by scalars in codespace in a reasonable way, namely multiply the distances by 
multiplying the codes and ‘carrying’ one will find a sequence for which there are no codes. (Multiply x = (N — 1)(N — 
1)--- by 2). 
11.1.9 By definition {x} C X if every element in {x} is in X. The only element in {x} is x which by assumption is in X. 
Hence {x} C X. 


Answers to Chapter Il, section 2 


II.2.1 We use extensively the inequality |x| + |y| = |x + y|. It is proven by separating it into three cases: a) x, y > 0. In this 
case |x| = x, |y| = y, and since the sum of two positive numbers is positive,|x + y| =x + y = |x| + |y]. b) x, y > 0. In this 
case |x| = —x, |y| = —y. Since the sum of two negative numbers is negative, |x + y| = —x — y= |x|+|y|.c)x >0, y <0. 
By changing the roles of x, y this proves the other possibility as well. |x| =x, |y| = —y. |x + y| = |x — (—y)| = [|x| —lyll. 
This latter number is either equal to |x| — |y| or its negative, and both are less than or equal to |x| + |y| with equality only 
if one or the other are zero. Hence |x + y| < |x| + |y|. We proceed to verify the metric space hypotheses: 


(a) 


(i) d(x, y) =|x - y|=|-G@—y)l=ly —x| =d(y, x). 

(ii) x — y £O unless x = y. It is not infinite if x, y are not infinite. Since taking the absolute value does not alter either 
of these properties 0 < d(x, y) < 0. 

(111) d(x, x) = |x — x| = [0| =0. 

(iv) d(x, y)=|x — y|=|x —z+z-—y|< |x — z| + |z — y| by means of the proof above. 


(b) (i), (ii), (iit) can be verified by multiplying everything in part a) by 2. (iv) d(x, y) = 2|x — y| =2|x —z+z-y|< 
2(\x — z| +|z — y| =2|x — z]+2|z— yI. 

(c) All properties can be done by a) substituting x°, y°, z* everywhere in the above arguments. 
11.2.3 d(x, y) = |xy| is not a metric for R. Choose x = 0 and y £0. Then x $ y, but d(x, y) = |0- y| =0, which violates 
property (i). 
11.2.4 On the punctured plane R? \ O define d(x, y) = |r; — r2| + 6. Then for two points x, y, we have 


(1) d(x, y) = |r) —r2 + O| = |r. — | + 9 = dy, x) since 6 is still the smallest angle subtended by the two rays. 
(2) Ifx # y and |r; — r2| =O then 9 £0 sod(x, y) > Oif x # y. 9 < 7 so that for d(x, y) = oo we must have either r, 
or rz infinite. But r, = xf + x} and r2 = y; + y3 so if either is infinite so is either x or y. Hence 0 < d(x, y) < oo if 


XY. 


416 


Selected Answers 417 


(3) d(x,x)=|r—r|+0=0+0=0. 

(4) d(x,z)+d(z, y) =|r) —7r3| + Irs — ro] + 01 + 02. 01 + 2 > O if the ray to z is not between the other two, and 
6, + 6, =9 otherwise, so d(x, z) + d(z, y) = |n — 73| + Ir3 — re] +8 = Inn — 73 +73 -— | +9 =I|In-—nrl+O= 
d(x, y) by problem 1 above. 


11.2.5 If x, y are not equal, then there is an x;, y; pair which do not agree. Hence |x; — y;|/(N + 1)‘ #0. Since all the 
terms in the sum are positive or zero, there is no cancelling, and therefore d(x, y) 4 0. We can switch the order in each term 
because of the absolute value signs, consequently d(x, y) = d(y, x). The series converges since the numerators are less than 
2N so the series is dominated by 


el 
ND WET 


which converges for all N < 00. Because the series converges, the proof of all the metric requirements, except the above, 
follow from problem on each term, and therefore for the whole series. 


11.2.6 The easiest way to show that this equation for a distance measure satisfies metric properties turns out to be that we 
show that it really is a Euclidean metric, in the usual sense. Let x be an element of & the code space on N symbols. Then 
the series 


2 way 


converges to a number in [0, 1]. In fact it is the (N + 1)-ary expansion of a real number in this range, having the property 


that it has no digits equal to N, because the code space symbols range from 0,..., N — 1. For instance, when N = 9 we 
have 
y Soe care ts = Xj X2X 
A. 10! TF 100 = X14 X2X3... 


Then the mapping & — [0, 1] given by 
= 3 0 ae 
i=] (N = 1)! 


is a metric equivalence, with constants c;, c2 in the definition equal to 1. Using this map, let (x, y), (u, v) be two points 
in the space of shifts on N symbols. By the above equivalence, we have corresponding points (x, y), (u, Vv) in the square 
[0, 1] x [0, 1] C R?. Since the series converge, we have 

oo 


- =< Xj a (x; — ui) 
=). weir Dw T ae N + 1) 


The metric in the statement of the problem can therefore be written as 


d((x, y), U, v)) = JV @ — u)? + (9 — v/ 


which is Euclidean distance in R*, and hence a metric. 

11.2.7 In body space, the shortest distance between two points is the length of the path through the body. The shortest 
distance from a point to itself is then 0. The shortest distance between two points is the length of the path which is also the 
shortest path in reverse order. The length of a path between two distinct points cannot be zero, and a path of infinite length 
inside the body must not be the shortest path between two points. Finally the shortest path between two points in the body 
is less than the path which takes a detour to a third point unless that point is on the shortest path already. Hence d(x, y) is a 
metric. 

11.2.8 For a function d which is not a metric, see for example exercise 2.3. For the annulus to look like a cylinder, we must 
have the distance around any concentric path look the same. This implies that we need the distance around all concentric 
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paths equal. Use the angle subtended between two points to do this. To look like height, we need a linear radial distance, use 
the radius. If we want to look like the surface of a cylinder with Euclidean metric (unrolled), we thus use ,/(r,; — r2)? + 67. 
The proof that this is a metric follows exercise 2.4. 

11.2.9 That a distance measure formed by great circle distance is a metric is true on any two-sphere. We verify the conditions 
as follows: 

(i) The plane defined by the center C of the sphere and any two distinct points x, y on its surface cuts the sphere on the 
great circle path between the two points (that is its definition), so there is one and only one great circle distance between the 
two points. This distance is r6 where 6 is the interior angle xC y. Hence d(x, y) = 0 if and only if 6 = 0 that is x = y. 

(ii) Because we are dealing with interior angles, the angle xC y and the angle yCx are equal hence d(x, y) =d(y, x) > 0. 
The maximum value of an interior angle is 7, hence d(x, y) <r < oo. 

(iii) To prove the triangle inequality, we note first that if x, y,z lie on the same great circle, then if z is interior to 
x,y, d(x,z)+d(z, y) =d(x, y), whereas if it is exterior then since an exterior angle is larger than an interior one, 
d(x,z) + d(z, y) > d(x, y). Consequently, we deal only with z not on the great circle defined by x, y. For this case, the 
RR? distance between the points x and z is given by /r? + r3 — 2rir. cos 6 by the law of cosines. Without loss of generality, 
r; =r. = 1 and the square root is monotonic increasing, so that this distance is proportional to 1 — cos 6 and hence decreases 
for decreasing 9. Consider the plane triangle in R° defined by x, y,z. By moving z towards the line Xy, we decrease 
the distances between x and z and between z and y (we are moving down the perpendicular from z). Since the above 
law of cosines formula insists that these distances can only decrease if the corresponding angles xCz and yCz decrease, 
the distances on the sphere are also strictly decreasing. This implies that d(x, z) + d(y, z) > d(x, y) in this case. Hence 
the triangle inequality holds for great circle distance on any sphere. When this sphere is interpreted as C, the great circle 
between oo and any other point is along the projection of vertical lines in the complex plane. Hence since 1 + i is above the 
real axis, its distance to infinity is less than that of zero, which is 7/2. 


II.2.10 Suppose that we have 
c1d,(x, y) < do(x, y) < cod (x, y) 
where cj), C2 are positive real numbers. Using the first inequality, we have 
1 
d(x, y) = ee y) 
1 
and the second gives 
1 
—d,(x, y) < d(x, y) 
C2 
Let e, = ~ and e; = ar and combine the above two inequalities, we get 
€:d2(x, y) < d(x, y) < é2d2(x, y) 


as desired. 

II.2.11 Yes in both cases. Let d,(x, y) be Manhattan metric,d,(x, y) be Euclidean, and use, for instance c, = 1/2, cp =2. 
Since the Manhattan metric is at most /2 times the Euclidean metric (when they lie on a line at a 45° angle) these constants 
will do fine. 

1.2.12 Two metrics are uniformly equivalent if and only if there are constants c;, c2 such that 


cyd\(x, y) < do(x, y) < cod) (x, y) 


for every pair x, y in the space. To see that the metric d(x, y) = |r; — r2| + @ is not equivalent to the Euclidean metric on 
the annulus, choose d, to be the Euclidean metric, and d, to be this one. Let x, y be on the real line x < 0 and y > 0. Then 
d,(x, y) > 6 =-7. Regardless of the value of c. by moving x, y towards the origin we can insure that d;(x, y) is less than 
any € by letting x = (—é/3, 0) and y = (€/3, 0). Hence c2d)(x, y) < cn. Then if € < c2/m the inequality fails. Hence the 
two metrics are not equivalent. 
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11.2.13 Use h(x) =x — 1. Then d;(h(x), h(y)) =2-|(x — 1), (y — 1)| =2- |x — y|. This metric is equivalent to d;(x, y) 
by using c) = 1/4,c,.=1. 

II.2.14 To show that (@,Manhattan) and (@,Euclidean) are equivalent metric spaces, use h(x) = x. It is easiest to see 
the equivalence by making the substitution x’ = 0, y’ = y — x = y; — x1, y2 — X2. Then the Euclidean distance d(x, y) = 


d(O, y)= y+ y,” =r. The Manhattan distance in the same notation is |r cos @ + r sin 6|. Maximizing the last expres- 


sion with respect to 6 yields 9 = 2/2 or cos@ = sin@ = /2/2. So the maximum difference in size between the two metrics 
is dy = /2dz. Then 


l 
—dy (x, y) < dg(x, y) < dy (x, y). 


/2 


11.2.15 Using h(z) = A(z; + iz2) = (zi, Z2), we have d(x, y) =dg(x, y) on R?. The above problem gives the metric equiv- 
alence for this with the Manhattan metric. 


11.2.16 Suppose they were equivalent. Then there are constants c;, c2 such that 
c1d,(x, y) < d(x, y) < Codi (x, y) 
Let x = 1 and y= nara Then d2(x, y) = 2c2d;(x, y), a contradiction. Hence they are not equivalent. 
11.2.18 If two metric spaces are equivalent, there is a one-to-one, onto map h : X; — X> such that 
1d, (x, y) < d)(h(x), h(y)) < c2di (x, y) 
The second inequality implies that if 
d,(x, y) < 6 =€/c2 then d,(h(x), h(y)) < 26 =€. 


Since h is invertible, we may write the first inequality as 
1 
d\(h7'(x), h-"(y)) < 5,2 9) 


for points x, y € X2. Then d;(h~!(x), h7!(y)) < € whenever d>(x, y) < 6 = c\e. Hence h and its inverse are continuous, so 
h(x) is a homeomorphism. 

11.2.19 Intuitively, these metrics behave similar to base 10 decimals. In fact, if we look at X19 the symbols are 0-9, and by 
choosing k = N = 10, we get decimal expressions (with some ambiguity). To show that these metrics are topologically 
identical, we must demonstrate that the identity map .(x) = x is a homeomorphism. This is equivalent in epsilon-delta 
format to showing that we need to show that for every x, € > 0 there is a 6 such that for every y in Ly the two statements 


dy, (x, y) < 5 => d,, (x, y) <€ 
and, 
dy, (x, y) <4 => dy, (x, y) <€ 


are true. 

We can simplify matters by breaking the proof into cases and reducing it to a single statement to prove as follows: 

Case 1: k, = ky. In this case there is nothing to prove, since a metric is always equivalent to itself. 

Case 2: k; < ky. In this case, by comparing terms of the two series, since ki, > ki, de (x, y) < dk, (x, y) for any x, y with 
equality holding only for x = y. Consequently, we may choose 6 = ¢ in the first statement: 


dy, (x, y) <€> dy, (x, y) = dy, Es 
and we need only prove the second statement, namely 


dy, (Xx, y) <6 => dy, (x, y) <€ 
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Case 3: This case requires only the proof of the first statement for the same reasons as case 2. There is no loss in just 
assuming case 2, and reversing the subscripts for case 3. 
Consequently, all we need to do is show that if k, < k», 


dy, (x, y) < 6, => dy, (x, y) = 


and then take 6 = min(4;, €). 

Looking at the analogy between decimals and yw, we can get the distance d,(x, y) arbitrarily small if x; = y; up to some 
big number ip. This would correspond in decimal arithmetic to the first ig decimal places being equal to zero in the distance 
formula. Writing it without the analogy, since the series is less than 


then 


[feb 


To insure this is less than, say a we only need to choose i large enough. This is nice, we can get a set we can describe by 
just using the digits that put us inside a given B,.(x). What we don’t know is whether it contains a ball inside it (For large 
enough N it is a ball, but we don’t know that). On the other hand, we can look at the smallest possible distance we could get 
if any of the first ip “digits” were not zero. This would be the distance between an x and a y with only the ip digits different, 
namely 


[Xig — Yio | =1 


and x; = y; whenever i $ io. This distance is 
1 
a(x, y)= ts 


So any ball Bs(x) with 5 < 1/k'0 is guaranteed to have no elements which disagree on the first ip places. 
We now proceed as follows: Choose ip such that the set 


B={y:x, =; if i < io} 


is inside of the d,, ball B.(x). Now choose 5p such that the d,, ball Bs,(x). Since all elements of this d,, ball have at least ig 
“digits” of agreement with x, everything in the ball is in the d,, ball of radius €. Therefore, 


dy, (x, y) < 59 => d(x, y) <€ 


which was all we decided we needed to prove. 


Answers to Chapter Il, section 3 


11.3.1 By Definition 3.3, if {x,} is a Cauchy sequence in (X, d) a complete metric space, then it converges to a point x € X. 
This implies that given an € > 0 there is an N such that ifn > N then x, € B(x, €). Since there are infinitely many integers 
greater than any integer N, x, € B(x, €) for infinitely many integers n. 

II.3.2 We will use the property of (IR,Euclidean) that every set bounded above has a Least Upper Bound (LUB). This 
implies that a monotone increasing sequence {/,,} which is bounded above has a limit. Since the LUB cannot be less than 
1, for any n (the sequence is increasing) and there is no minimum distance to all the J, (since it is the LUB) it follows that 
for any € > 0 there is an N such that x, ¢ B(LUB, €) forn > N. We use this property to prove completeness of (R, d) as 
follows: Let €; = 1/2' and N; be for each i a number such that N; > N;_; and d(x,, Xm) < €; Whenever n,m > N;. It follows 
that there is an interval of length ¢€; such that all the x, are in this interval if n > N;. Let 1; be the greatest lower bound 
of the set {x, :n > N;} (which exists by subtracting everything from-a large number and using the LUB principle). The 
sequence {/;} is then monotone increasing, and is bounded above by the LUB of the same set of x, which we call L;. Then 


Selected Answers 421 


this sequence has a limit which we will call x. We claim that x is the limit of the Cauchy sequence as well. Given € > 0, 
B(x, €) D [l;, L;] for some i, hence if n > N;, x, € B(x, €) which means x is the limit of {x,,} and the sequence converges. 
Hence (R, Euclidean) is complete. 


11.3.3 We show that R* is complete by using the fact that R is complete. Let {(x,, y,} be a sequence in R? which converges 
to (x, y). Then {x,} > x, {y,} — y. The converse is also true in R”, since given € > 0 we can find a small rectangle 
B(x, €,) x B(x, €y) C B((x, y), €). In turn, we can find N,, Ny such that x, € B(x, €,) ifn > N, and y, € B(y, €y) if 
n > Ny. Choose N = max(N,, N,). Then for n > N, (xn, yn) € B((X, y), €) since it is in the small rectangle inside. With 


this in mind, let {(x,, y,)} be a Cauchy sequence. Then because ,/ (x, — Xm)? + (Yn — Ym)? < € implies |x, — Xm| < €, |¥n — 
Ym| < €, {X,} and {y,} are each Cauchy sequences in R. By problem 2, these converge, so {(x,, y,)} converges. Hence R? is 
complete. 


11.3.4 There are two ways of approaching this. We can either repeat the arguments in the answers to exercises 3.2 and 3.3 
for M, or we can peek ahead to the definition of a closed set. If (X, d) is a complete metric space, the only way a Cauchy 
sequence in a subset can fail to converge in the subset is if it converges to a limit which is not in that subset, because viewed 
as a Cauchy sequence in X, it has a limit. The limit of this sequence is thus a limit point of the subset, and hence if the set 
is closed, it is in the set. So any closed subset of a complete metric space is complete. @ is a closed subset of R? so it is 
complete by problem 3. 


11.3.5 The sphere can be thought of as an embedded closed subset of R*. The spherical metric is equivalent to the R° 
Euclidean metric. To see this, note that the Euclidean metric, used on the sphere gives to each pair of points x, y the length 
of the chord between them. This length, for a sphere of radius 1 is ./2 — 2.cos6 (by the law of cosines). Using the constants 
2/m and 1/7 this is a metric equivalence to the distance measure of great circles. If {x;} is a Cauchy sequence on the sphere, 
it is in R? as well by this equivalence, consequently it has some (R°*) limit x. Since the sphere is a closed subset of R°, it 
contains its limit points, and therefore contains x. Hence the sequence has a limit, and the sphere is complete. 


11.3.6 Let {x,} be a Cauchy sequence on the codespace © on N symbols. Then given € > 0 there is an M such that if 
1 


01, 02 > M then d(x,,, Xo,) < €. In particular, if we choose € = CA then since any of the first i terms must either be 0 or 
contribute more than this value, x,,, X., must agree on the first i terms. Choose x such that for each i, the first i terms of x 
agree with all those elements of {x,} with o > M;, where M; is the M corresponding to the above choice of €. Then x € & 
and for each € > 0 there is an M such that d(x, x,) < € if o > M. In other words x is the limit of {x,}. Hence codespace is 
complete. 

11.3.7 Let { f;} be a Cauchy sequence on (C[0, 1], D). Then for each s € [0, 1], the sequence { f;(s)} is Cauchy in R, since 
given € and N guaranteed by the assumptions, d(f;(s), ;(s)) in the Euclidean metric is certainly less than the maximum 
val 1e over s. We will anticipate a little and call the limit of each such sequence for each s f(s). As defined, f(s) is clearly 
a finction on [0, 1], we must show that it is continuous. Given € > 0 there is a 6 > O such that if d(s,, s2) < 6 then for any 
i, D(fi(si), fi(s2)) < €/3. This property, called equicontinuity, is true for bounded functions on a closed interval, such as 
we have. Then there is also an N such that if i > N d(f;(s,), f(s1)) < €/3 and d(f;(s2), f (s2)) < €/3 since the sequence 


coriverges. Consequently, for our choice of 4, 


d(f (81), f (s2)) < d(fi(s1), f(s1)) + d(fils1), fi(S2)) + d(fils2), F(S2)) < € 


by the triangle inequality, and hence f(s) is continuous. Then f € (C[0, 1], D) and the space is complete. 

11.3.8 Let (X,,d;) and (X2, d)) be equivalent metric spaces. Then there is a homeomorphism h : X, — X, such that 
d(::, y) = d,(h(x), h(y)) is equivalent to d(x, y) on Xj. Let {x,} be a Cauchy sequence in X2, and let € > 0, and N large 
enough that d2(xn, Xm) < € whenever n,m > N. There exists a sequence {y,} in X; such that h(y,) =x,Vn, and by the 
metric equivalence, we have that 


di (Yns Ym) < C24(Yny Ym) = Codn(Xns Xm) < C2€ 


Consequently, by specifying 5 = c,€, for any € > 0 there is an N such that d2(X%,,%m) < 6 which forces d\(Yn, Ym) < 
€V1,m > N, in other words {y,} is a Cauchy sequence in X;. This has a limit y because X, is complete. Since for all 
5 >- O there is an N such that d,(y, y,) < 6 forn > N, then since (d)(y, yn) < (1/c1)di(y, yn) < (1/c1)6 = €, it follows that 
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d (Y, Yn) = d(h(y), Xn), {Xn} > h(y) = x and the sequence converges in Xj. Hence X» is complete. In the process, if two 
spaces are metrically equivalent, then a Cauchy sequence in one corresponds to a Cauchy sequence in the other under the 
metric equivalence. 

11.3.9 Given two points x = (x), X2), y = (y1, y2) in (@,Manhattan), any path which consists of straight horizontal and 
vertical segments, which are monotonic in each coordinate have the same length. Since the sum of these lengths is equal 
to the distance between the two points, they are all shortest paths. 


II.3.10 Suppose {x,} — x in the space X. Then given €/2 > 0 there exists an N such that d(x, x,) < €/2 ifn > N. Choose 
two such points x,, X» then by the triangle inequality 


A (Xn: Am) = Oy 7 OG, Xe) SE 


So given €, ifn, m > N, d(Xn, Xm) < €, in other words, the sequence is Cauchy. 


II.3.11 Suppose {x,} — x and {x,} > y. Then given €/2 > 0 there is an N, such that d(x,, x) < €/2, and an N2 such that 
d(Xn, y) < €/2. Choose N = max(N,, N2). Then forn > N 


d(x; y) = dy xq) + dns y) <€ 


Consequently, for any € > 0, d(x, y) < €. In other words, d(x, y) =0, orx = y. 


I1.3.12 Problem is misstated. 0 is the limit regardless, in the second space the sequence does not converge because 0 is not 
in the space. 

II.3.13 Define the sequence {x,} by x, = aWn. This sequence is clearly Cauchy, d(xn, Xn) = 0 < € for any € > 0 and any 
n,m. It also converges since d(a, x,) =0 < € for all n and any € > 0. The space contains no limit points since a limit point 
is defined as being the limit of a sequence but not a member of the sequence. Since this is the only sequence in the space, 
the space contains no limit points and therefore it contains all its limit points (there are none) but is not perfect since a is not 
a limit point. 

11.3.14 {x, =n} does not converge in (R,Euclidean). Suppose it did. Then by Theorem 3.1 this is a Cauchy sequence. But 
for any n Am d(Xn, Xm) = 1 and thus is not less than any given € > 0 for any N. For (C, spherical), the distance to the 
Point at Infinity is calculated by looking at the tangent of the angle subtended by the real line and the projection through the 
sphere. The radius of the sphere is 1 so this tangent is 1/n. Given € > 0 we may choose N such that 1/N < e, and therefore 
ifn > N d(oo, x,) = 1/n < €. Hence the sequence converges in (C, spherical). 

11.3.15 Let x be a limit point in X,. Then there is a sequence {x,} — x,x #x,Vn, in X, by definition of a limit point. 
Since h provides a metric equivalence, given € > 0 there is a d > 0 such that ifn > N, d\(x, X,) <6 = (1/c)e implies that 
dy(h(x), h(Xn)) < €. Hence since h is onto {h(x,)} — h(x) € X>. Since h is 1-1 and onto, h(x,) # h(x)Vn. Hence h(x) is a 
limit point in X>. 

1I.3.16 (@,Euclidean) is defined as being those points whose x and y coordinates both lie in [0, 1]. The points in this set are 
all greater than 1 in the y coordinate, hence are not in @. The set, taken in R* has two limit points in @ (it does not converge 
as a sequence). These are at (1, 1) and (—1, 1). 

11.3.17 Any point of the set S is the limit of a sequence which converges because eventually all the terms are the same. This 
type of sequence is always possible (see exercise 3.13), however such convergences do not qualify the point as a limit point 
of the set. The only such point would be 0, since this is the only Cauchy sequence converging to point not in the sequence. 
Since this point is not in the space (0, 1], the set of limit points of S is empty and therefore a subset of 5. Hence S is closed 
in (0, 1],Euclidean). 

1.3.18 Let S = [0, 1] c (R,Euclidean). Let x € S, and let a = min(x, 1 — x). If a =0, set a = 1. In other words, a is the 
least distance to an end of the interval. Then the sequence {x, =x +a/2"} | lies entirely in S. Given € > 0 there is an 
N such that € > a/2%. This implies that forn > N, d(x, x,) =d(x,x — a/2") =a/2" <e, in other words this sequence in 
[O, 1] converges to x, hence x is a limit point. Since x was any point in the set, the set consists entirely of limit points. It is 
closed, and therefore perfect. 


II.3.19 By reference to exercise 3.17, this set contains its only limit point, 0. It is not perfect since for any n there is no 
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Cauchy sequence (and hence no convergent sequence converging to 1/n which does not contain 1/n. Hence 1/n is not a 
limit point, and the set is not perfect. It is closed, since it contains it’s limit point. 

11.3.20 That © contains its limit points is true because it is the whole space and is therefore closed. We need only verify 
that each point in & is the limit of a sequence {y,} which does not include it. This is done as follows: Let x = x,x2...€ & 
and suppose we have N symbols. For each n let y, be the point which agrees with x on all symbols except the n™, and is 
equal to the next greater symbol at the n™ place unless that symbol in x is N — 1 in which case use N — 2. Then for each n 
we have 


[o.@} 


d(x, Xn) =) ee ee Neg 
é (N+1)) (N+1)") (N+1)" 


because there is only one non-zero term in the sum. This sequence is then convergent-given € > 0 there is an M such that 
forn > M d(x, yn) =1/(N + 1)" <«. No term in the sequence is equal to x because it disagrees at the n"™ term. Since x is 
arbitrary, every point in © is a limit point and & is a perfect set. 

II.3.21 If (S, d) is complete, then given a Cauchy sequence in S, it converges to a point in S. Since every limit point is the 
limit of a convergent sequence in S, and hence a Cauchy sequence in S, S contains its limit points and is therefore closed. 
Suppose S is closed, and let {x,,} be a Cauchy sequence in S. Then {x,,} is a Cauchy sequence in X as well and there is a 
point x € X such that {x,} — x, because X is complete. Hence x is a limit point for S. Since S is closed, x € S. 
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11.4.1 In order for X to be closed, it must contain all its limit points. Thus let {x,} be a convergent sequence in X. By 
definition, there is an x € X such that {x,} — x. Hence X is closed. A Cauchy sequence need not converge however. Let 
X c Y, such that Y is complete. Then if {x,} > y ¢ X, then this sequence does not converge in X, and y is not a limit point 
in X. The sequence is nevertheless Cauchy, and since it does not converge, X is not a complete metric space. An example is 
m=), 1). 

11.4.2 See the answer to exercise 3.21 (Chapter II). 

11.4.3 By exercise II.3.15, if x is a limit point of the set S in X; then (x) is a limit point of the set 6(S) C X>. Suppose 
there is an element in 6(S) which is a limit point, namely that there exists a sequence {(9(s)),} > s € 0(S). Then given 
€ > O there is an N such that forn > N, 


d,(07'(0(s)n), 9 '(0(s))) < (1/c1)d2(O(5)n, O(8)) < € 
Letting 0—'(6(s),) = 5, we see that 6(s) is the image of a limit point in S. Since S is closed, the image of s is in 6(S), thus 
8(S) contains its limit points, and is therefore closed. 
11.4.4 Given x € X, by definition B(x, €) = {yinX : d(x, y) < €} and hence is a subset of X for any «. Hence X is open. 


11.4.6 Let S be a bounded subset of (R%,Euclidean). By definition, there is a point a € S and a number R, such that 
d(s,a) < RVs € S. Explicitly, 


N 
7 oe —a,)*<R 


n=1 


The terms in the sum are all non-negative, it therefore follows that (s, — Gn)? < R? or that |s, — a,| < R, for each entry. 
This means that S is contained in the box [a; — R, a, + R] x [a2 — R,a. + R] x --- x [ay — R, ay + R]. To provide a 
finite e-net for S, we subdivide each interval into M intervals such that N/M < . Then the product of the endpoints of 
each interval is a point in a finite €-net of this N-cube, its intersection with S is also finite and each point of S is less than 
€ away from one of the above corners. Hence S$ is totally bounded. To show that R* is complete, it suffices to note that R 
is complete (exercise II.3.2) and that a sequence is Cauchy or convergent if and only if it is Cauchy or convergent on each 
of its coordinates. Finally, since R” is complete, and S is totally bounded if it is bounded, it follows that if S is closed and 
bounded it is compact. 
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11.4.7 Let (X,d) be a metric space, f : X — X be continuous, and A be a compact non-empty subset of X. It follows 
immediately that if a € A, then f(a) € f(A) by definition, so f(A) is non-empty. Let {b,} be a sequence of points in 
f (A). Then each b, is the image of at least one point in A. Let {a,} be any sequence in A such that f(a,) = b, for each 
n. Because A is compact, there is a subsequence {a,} C {a,} which converges to some point a € A. The sequence { f(a,;)} 
is a subsequence of {b,,}, we must show that it converges to a point in f(A). In fact, because f is continuous, given € > 0, 
there is ad > 0 such that if d(a,,,a) < 6 then d( f(Gn,), f (@) < €. Since this subsequence converges in A, given 5 > 0 there 
is an N such thati > N implies d(a,,,a) < 6 and therefore that d(f(a,;), f(a)) <€. This subsequence therefore converges 
in f(A), and letting b,, = f (d,,;) we have found a convergent subsequence for the sequence {b,} C f(A). Hence f(A) is a 
compact non-empty subset of X. 


11.4.8 Let S Cc (X;,d,) be open. Then it follows that by example 2.4.5, X, \ S is closed. By exercise II.4.3, h(X, \ S) 
is closed in X>. Let t € h(X; \ S). Then it follows that since h is invertible, h~'(t) ¢ S, hence t € X, \ h(S). Hence 
h(X, \ S) = X2 \ h(S). Hence X, \ h(S) is closed which means that h(S) is open in X2. Note that the proof depends on 
the invertibility of h. It does not hold even for arbitrary continuous functions. 


1.4.10 Given x € (0, 1) we have 0 < x < 1 so choose € = min(x, 1 — x). Then the set B(x, €) = {y € X: d(x, y) < €} is 
a subset of (0, 1). Hence (0, 1) is open in X. Let x € {2}. This is a singleton set, so x = 2. Choose € < 1. Then the set 
B(x, €) ={y € X: d(x, y) < 1} contains only the point 2 and d(x, 2) =0 < €. Hence {2} is open in X. The two sets are 
both closed: (0, 1) = X \ {2} is the complement of an open set, by 2.4.5 it is closed. By the same token {2} = X \ (0, 1) so 
it is the complement of an open set and is closed. {2} is compact, since given any sequence in {2} all entries are 2, hence 
the sequence converges, and has a convergent subsequence (any subsequence). (0, 1) is not compact, choose the sequence 
{x, = 1/n}. This converges to 0 ¢ (0, 1), so any subsequence must likewise converge to 0. Hence it has no convergent 
subsequence. 


1.4.11 Let S Cc (X, d). By definition, its boundary 0S is the set of points s such that any open ball B(s, €) contains points 
in S and in X \ S. But S = X \ (X \ S), since it is the set of points which are in X which are not in Xsp \ S. Consequenctly 
if any open ball B(s, €) around a point s contains points in both sets it is in the boundary 0(X \ S, and the two boundaries 
are the same. X is open (exercise II.4.4) hence @ is closed, and contains it’s boundary. Since this implies the boundary of X 
is contained in the empty set, 0X = 9. 


11.4.12 Let (X;, d,) and (X>2, d.) be equivalent metric spaces and let h : X; —> X, be the metric equivalence. Let S C Xj, 
and let s € 0S. If h(s) ¢ dh(S) then there is an open ball B(h(s), €) which is either a subset of h(S) or of X, \ h(S). If 
it is a subset of h(S), then it is the image of an open set in S (see problem II.4.8), and by definition there is an open ball 
around s in S, which contradicts the statement s € 0S. If it is a subset of X2 \ A(S), then there are no points in B(h(s), €) 
which are the image of points in §. Hence B(h(s), €) is the image of an open set containing s which is contained in X, \ S. 
Then there is an open ball within this open set, which is centered at s, and contains no points in S. Again this contradicts 
s € 0S. Consequently, since this is true for all s in 0S, h(0S) C dh(S). If we do this again for the transformation h-'! with 
the boundary of h(S) we get that the inverse image of the boundary of h(S) is contained in 0S. Then the two boundaries are 
images of each other, and h(dS) = dh(S). 


11.4.13 S Cc (R,Euclidean) is the set of rational points in the real line. We want to show that the boundary of S is R itself. Let 
x € R, and let € > 0 be given. Suppose a, b are two integers such thatb=a+ 1 anda <x < b. Now let N be chosen such 
that 1/2" < €. Since the numbers a + k/2% for k < 2 subdivide the interval [a, b] into 2" equal parts of width 1/2" <«€ 
it follows that any interval which is greater than € in width contains at least one of them. But B(x, €) = [x — €, x + €] is of 
width 2e, hence there is a rational number in B(x, €). The number V2 is not a rational number: Let p/q = /2 and suppose 
the fraction p/q is in lowest terms. Squaring both sides yields p* = 2q” implying that p is even. Hence p? is divisible by 4. 
Hence q is even, a contradiction. If c is irrational, so is p/(cq) for any rational p/q. If not, by simple algebra we may write 
c in fractional form. Now take all the numbers in [a, b] of the form a + k/(./22”. Once again these are evenly spaced and 
any interval of width less than € contains at least one of them. Hence there is a number in B(x, €) which is in R \ S. Hence 
x € 0S. Since x was arbitrary, 0S = R. 


1.4.14 The only point in C for which every €-ball contains points in C and points in C \ C = {00} is oo. Hence the boundary 
of C in C is {00}. 
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1.4.15 Let S Cc X be closed. Suppose that there is a point in X such that for every « > 0 B(x, €) contains points in S and 
points in X \ S. Define a sequence {x,} by x, € B(x, 1/n), x, # x. If this cannot be done, then 0S = @ and the statement is 
true. Then the above sequence makes x a limit point of S and hence x € S. Hence 0S C S. 


1.4.16 Suppose S is open. Let s € dS. Then for every € > 0 there is a point x € B(s, €) such that x ¢ S. Then there is no 
€ > O such that B(s, €) C S. Since S is opens ¢ S. Hence DSN S=@. 


11.4.17 By definition S° = S \ 0S. Since SN AS = @, S \ AS = S. Hence S° = S. Conversely, suppose S° = S Then 0S = @. 
Consequently for any s € S there is an € > 0 for which B(s, €) contains no points in X \ §. Then B(s, €) C S, and S is open. 


II.4.18 Since S is closed, by exercise II.4.16, 9S C S. By definition S° = S \ 0S. We have S \ dS UdS=S. 


II.4.19 By exercise II.4.12, being the boundary of a set is invariant under metric equivalence. Thus if s € S°, then s € S 
and s ¢ 0S. Since h(s) € h(S), and h(s) € dh(S) = h(0S) implies that s € 0S which contradicts hypothesis, h(s) € (h(S) \ 
dh(S)) which means h(s) € (h(S))?°. 

11.4.20 X \ aS is S \ 8S U(X \ S) \ AS. Since 0S = 9(Xsp \ S) (by exercise II.4.11) X \ dS = (X \ S)° U S®, which are 
disjoint sets, and by II.4.17, are open. Note that one or the other or both may be empty, however. For the examples, (a) the 
boundary of the set S = {(x, y) € R?: x? + y? < 1} is the set 0S = {(x, y) € R?: x7 + y? = 1}, since every €-ball around 
a point in this set contains (x, y) € S and as well (x, y) such that x? + y? > 1. This is a circle of unit radius and divides 
the plane into the open disk interior, and the plane with a circular hole of radius one in it. (b) S = R* the whole space, so 
0S = @. The two sets R* and @ are both open as shown in II.4.1 and II.4.4. 


1.4.21 By problem II.4.20, X \ dS is the union of two disjoint open sets. The union of open sets is open, hence 0S is closed. 


II.4.22 We start this problem with an elementary fact about compact sets. Let C be a compact set. Then if A C C is a closed 
subset, let C,, be an open cover for A. We can extend this cover to an open cover of C, by adding €-balls in C \ A (which is 
open) we can form an open cover for C. There is therefore a finite subcover {C), ... Cy}. Since by our choice, all of the C,, 
which cover A are from the original open cover, and they are a subset of this finite subcover, there is a finite subcover for A. 
It follows that any closed subset of a compact set is compact. 

We now turn to the problem at hand. S is a subset of a compact metric space. By problem II.4.21, 0S is closed, and also 
a subset of the same compact metric space. Therefore 0S is compact. The assertion is best proven by proving the following 
statement which is true for arbitrary subsets S (closed, open, otherwise). Let S be a subset of a metric space (X, d). Then 
A C S is open in the space (S, d) if and only if we can write A = S/N O for some open subset of X. If A is open in (S, d) 
then for each a € A and each e > 0 


Bs(a,e)={y Ee S:d(a, y)<e}={yeX:d(a, y)<e}NS=Bx(a,e)nS 


so A= SMO where O is the union of such balls in X. Conversely, if A = SM O for some open set O € X then givena € A, 
there is an € > O such that Bx(a, €) C O, and 


Bs(a,e)={y € S:d(x,y)<es={yeX:d(x, y)<e}NS=Bx(a,e)nS 


The latter intersection is in A = $M O, so A is open. 

The assertion that the interior of the land is open in Z despite the fact that it includes points in the “border” of Z, is true 
by the above statement. The fact that it appears to include border points comes about because these points are points in 
dZ C X, whereas when Z is itself the metric space, the border of the whole space is the empty set. 
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11.5.1 The properties of connectedness preserved by metric equivalence: (i) Connectedness. Let (X,, d,) and (X2, d2) be 
equivalent metric spaces and h : X; —> Xz be the invertible function giving this equivalence. It was proven last section 
(problem II.4.8) that if S is an open set, then h(S) is open, and (problem II.4.3) if S is a closed set, then h(S) is closed. 
Because h=! also establishes an equivalence, the same is true about closed and open sets in X, under h~!. Let X; be 
connected. Suppose 5S C X2 is both closed and open. Then h~'(S) is closed because S is closed, and open because S is 
open. Since X; is connected, h~'(S) is either X; or @. In the first case, because h is onto, S = X. In the second S = @. 
Hence X, is connected. 
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(ii) By studying the definition of metric equivalence, we see that given € > 0 there is a 6 = (1/c2)e such that d,(h(x), 
h(y)) < € whenever d\(x, y) < 6. Consequently h is a continuous function. The composition of continuous functions is 
continuous. Suppose X; is pathwise connected. Given two points x, y € Xz because h is onto, there are a pair of points 
u, v € X; such that h(u) = x, h(v) = y. Because X, is pathwise connected, there exists a continuous function f : [0, 1] > 
X, taking f(0) =u, f(1) = v. But then there is a continuous function h(f(y)) taking a point y € [0, 1] to Xz such that 
h(f (0)) =h(u) = x and h(f (1)) = h(v) = y. Consequently, Xz is pathwise connected. 

(iii) Suppose X, is disconnected. Then there exist a set A # X; and A #9 which is both open and closed in X;. As in the 
proof of i), since this set is open, h(A) is open in X;, since the set is non-empty, h(A) is non-empty, and since A # X, and 
h is onto, h(A) # X2. Hence X; is also disconnected. 

(iv) Suppose X, is pathwise disconnected. Then there exist points x, y € X; such that there is no continuous function 
mapping [0, 1] to X, with f(0) =x, f(1) = y. Suppose there were a continuous function g : [0, 1] > X, such that g(0) = 
h(x), g(1) = h(y). Then, because h~! is continuous, the function h~'(g(w)) taking the interval to X, would take fe *(e(O) = 
x, h~'(g(1)) = y, acontradiction. Hence X; is pathwise disconnected. 

(v) Suppose X, is simply connected, and, because h is continuous and onto, let fi, fr: [0,1] > X, be two paths 
connecting points h(x), h(y) in X. Then there exist paths h—'(f,({0, 1])), h-(f({O, 1]) connecting x, y € X;. There is 
then a continuous deformation g(s, t) transforming h~'(f,([0, 1])) to h-'(f2({0, 1])). Then h(g((0, 1] x [0, 1])) > X2 isa 
continuous function, and 


h(g(s,0)) =h(h“'(fils))) = fils), 
h(g(s, 1)) =h(h“'(fals))) = fals), 
while 
h(g(0, t)) =h(h'(h(x)) = h(x), and 
h(g(1, t)) =h(h'(h(y))) =hQ), 


hence X, is simply connected. 

(vi) Finally, assume the converse, that X, is multiply connected. Then there exist two points x, y € X, and two paths 

fis f2 : [0, 1] > X; with no continuous deformation between them. We again transfer everything to X2. The points 
h(x), h(y) in Xz have the two paths h(f;({0, 1])), h(/2(19, 1])) connecting them. If there were a continuous deformation 
g :[0, 1] x [0, 1]  X, deforming one path to the other, then h~'(g({O, 1] x [0, 1])) — X; would provide a deformation in 
X,. Hence there is no such deformation, and X, is multiply connected. 
11.5.2 The metric space (@,Euclidean) is simply connected: Choose x, y and two functions from [0, 1] — @ such that 
fo(0) = f;(0) =x and fo(1) = fi(1) = y. Let g in the definition be given by g(s,t) =t- fo(s) + (i —1¢)- fils), where 
a(x, y) = (ax, ay). This satisfies the four demands of the definition, we must only check that for all values of (s, t) it 
remains within the square. It would be maximized in, say, the x coordinate by setting fo(s) = fi(s) = (1, y) in which case 
for any t the function would equal t + 1 — t = (1, y). Since this is clearly in the square, there is no problem there. The other 
cases are identical. Hence ™ is simply connected. 


11.5.3 To show that X = ((0, 1) U {2}, Euclidean) is disconnected, we examine the subset {2}. This set is closed since points 
are closed in metric spaces. It is also open: The ball B.(2) C {2} for € = 1/2 since by definition this ball is the set of all 
points in X which are closer than 1/2 to the point 2 of which there is only the point itself. Consequently, for every point in 
the set {2} C X, there is an open ball around that point in the subset, hence the subset is open. Since this set is both open and 
closed, and not empty or equal to X, the set is disconnected. 


11.5.4 Given the codespace metric we have used, for the space & on the N symbols 0,...,(N — 1), there is a metric 
equivalence to the subset of (0, 1] written in base N + 1 of real numbers with no N + l-ary digits equal to N. The 
equivalence is generated by the mapping x;x2...+> .x)X2.... The map has the same distance formula in both cases, and is 
clearly 1-1 and onto. The set of such real numbers is a totally disconnected subset of ([0, 1], Euclidean): Given any closed 
subset of the real line, either that subset is a single point, or an interval (possibly [a, 00)). If not, we may write the subset as 
the union of two closed disjoint subsets, and there is a distance between them say c. Then for any point in each subinterval, 
an €-ball around that point of radius c/3 contains only other points from the same subinterval, hence each subinterval is 
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open as well as closed and the set is disconnected. Since the set given above contains no intervals of with numbers with the 
digit N it is clearly not an interval, and it clearly contains more than one point. Hence codespace is disconnected. To see that 
it is totally disconnected, suppose there were an interval enclosed in the equivalent subspace of [0, 1]. Then if a = .x,x2... 
and b = .y,y2... suppose they agree to x; < y;. Then the point .x;x2...(z; + 1=N)... is in the interval [a, b] and not in 
the set. Hence there are no intervals in the set, and it is totally disconnected, by our metric equivalence, codespace is totally 
disconnected. 

11.5.5 Put the annulus in R? centered at the origin, take x, y on either side of the origin, and take fo to be a path which goes 
around the upper half of the annulus, f, a path which goes around the lower half. Suppose the function g(s, t) exists as in 
the definition. Then for each point fo(s), s #x, y we have fo(s)|, > 0 and the corresponding value for f(s) is negative. 
Holding s fixed the intermediate value theorem says that there is a value of t such that g(s, t)|, = 0. Since we may take the 
annulus as thin as we please, this says that every point must pass through either x or y as it is deformed from one path to the 
other. If some points pass through x and some through y there is a value of ¢ such that g(s, t) is not connected, hence is not 
a path. Hence the annulus is multiply connected. 

11.5.6 Choose the set of nested subsets of R? given by S; = R? B; where B; = {(x, y): -1 <x <1, y <i}. These sets 
are nested, they are all connected, and the intersection is the plane minus a strip around the y-axis 2 units wide, which 
is disconnected. 

II.5.8 The body space is multiply connected, it has a hole corresponding to the entire gastro-intestinal tract, if not others 
(e.g. pierced ears). 
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11.6.1 If x and y are elements of H(X) then by definition they are both non-empty and both compact. Their union is 
therefore non-empty. To see that it is also compact, let C be a covering of x U y by open sets. Then there is a subset of 
C which covers x, say C, and another C, which covers y. Because x is compact, there is a finite subset of C, which covers 
x, likewise for y. The collection of sets formed by taking all the sets in the finite subcover of x and all of those in the finite 
subcover y forms a subcover of x U y, and since its cardinality is the sum of two finite integers, is also finite. Hence C admits 
a finite subcover of x U y which is therefore compact. Since x U y is compact and non-empty, it is in #{((X). To see that the 
intersection of two elements in #(X) need not be in H((X), take x and y disjoint. Then x M y is empty and therefore not in 
H(X). 

11.6.2 The elements of #1(X) are compact non-empty subsets of X, while a compact non-empty subset of X is only a single 
point in #(X). An infinite collection of compact non-empty subsets of X is a valid subset of 7{(X), while it’s union may or 
may not be a compact non-empty subset of X. 


11.6.3 Let B,C € H(X) and B CC. Let x € X. Then 
d(x, C) =min(d(x, y)) 
yeC 


_ min(min(d (x, y)), : 2 aed y))) 


< min(d(x, y)) = d(x, B) 
yeB 


11.6.4 The distance from (1, 1) to the closed disk of radius } centered at (;,0) is given by the distance to the point on 
the boundary of the disk closest to (1, 1), since this point is not in the disk. The closest point on the boundary is on 


a radius of the disk, by the triangle inequality. This means the distance is the distance from (1, 1) to G, 0) minus the 
distance along this chord which is inside the disk, namely 5. The whole distance is J/1+(— 1/2? = ./5/4 so the distance 
d(x, B) = (V5 — 1)/2. 

11.6.5 In this case, the closed disk of radius ; looks like a square with corners at (0, 0), (0, 1), (1/2, 1/2), (1/2, —1, 2) 


since this is the set of points equidistant from (1/2, 0) at a distance 1/2. In the Manhattan metric, the distance to any point 
on the right upper face of this square is 1/2 + 1/2 = 1. Since the distance to any other face is larger, d(x, B) = 1. 


11.6.6 The elements of the set B are less than 3 only if n is odd. The sequence {aa} increases monotonically towards zero, 
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consequently it has its least value at 1/2. The least value in B then is the value x; = 3 — 1/2 or 5/2. The distance to the 
point x = 1/2 is then d(x, B) =2. 

II.6.7 It is easiest to provide a counterexample. Let B C A be a proper subset, such that there is an x € A with d(x, B) =1. 
(For instance two disks centered at the origin one with radius 1 one with radius 2). Then d(A, B) > 1. Since B C A the 
minimum distance from any point in B to A is to the point itself, namely 0. Hence d(B, A) = 0. The two are not equal, so 
this is not a metric on 7#{(X). 


1I.6.11 Let A, B € H(X), A # B. Suppose d(A, B) = 0. Then for every x € A, d(x, B) =0, since we take the maximum 
over these to form d(A, B). This can only occur if miny-g(d(x, y)) = 0, for every point x € A, which means that for each 
one, there is a y € B with y = x. In other words A C B. By the same argument, if d(B, A) is alsoO, then BC A.If ACB 
and B Cc A then A = B. This is not the case so one or the other distance cannot be 0. The converse of the above is that if 
A C B, then for each point in A, 


dk 2) = min(d(x, yj=d, O=0 
ye 


so that the maximum over these is 0. Hence if A C B then d(A, B) = 0. 


II.6.12 Using the solution to problem II.6.3, for each x € A we have d(x, C) < d(x, B). Suppose d(A, B) < d(A,C). 
Then there is a point x € A such that d(x, C) > d(y, B) for all y € A. This is a contradiction, choosing y = x. Hence 
d(A,C) <d(A, B). 

11.6.14 We write out the expressions on either side of the inequality. 


da(A, B) = max(min d(x, y)) 
xeEA yeB 


On the other side, we have 


d(A, C) + d(C, B) = max(min(d(x, z))) + max(min(d(w, y))) 
xEA zeCc weC yeB 


Since the maximum over w € C is greater than or equal to the value if w is replaced by z, we can write 


d(A,C)+d(C, B) = max(min(d(x, z)) + oe y))) 


For any choice of x € A and y € B the sum is greater than the distance d(x, y) by the triangle inequality, including the case 
where z has been chosen to minimize d(x, z) and where y has been chosen to minimize d(z, y). Hence for the choice of y 
the sum of the minima above is greater than d(x, y) for that choice of y and hence greater than the distance d(x, y) which 
is the minimum chosen to minimize d(x, y) directly. We may then write 


d(A,C)+d(C, B) = max(min(d(x, y)) =da(A, B) 
xe€A ye 


For the other case, whether or not d(A, B) < d(C, A) + d(C, B), we use the case C C B C A. Then the right hand side is 
identically zero, while if A + B, the left hand side is not. Hence the expression is not true in general. 


11.6.16 We make extensive use of II.6.12. To this end, we have 
h(AU B,CUD)=d(AUB,CUD)vVd(CUD,AUB). 


Ifh(AU B,C UD) =d(AUB, CUD) then either h(A U B, CU D) =d(A, CUD) orh(A UB, CUD) =d(B, CUD). 
Choose the former. then by II.6.12, 


h(AU B,C UD) <d(A,C) <h(A, C) <h(A, C) V ACB, D). 
If we choose the latter, 

h(AU B,C UD) <d(B, D) <h(B, D) < h(A,C) V ACB, D) 
Ifh(AUB,CUD)=d(C UD, AUB) theneitherh(A UB, CUD) =d(C, AUB) orh(AUB,CUD)=d(D, AUB). 
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Choose the former. then by II.6.12, 

h(AUB,CUD) <d(C, A) <h(A, C) < h(A, C) V ACB, D). 
If we choose the latter, 

h(AU B,C UD) <d(D, B) <h(B, D) < h(A,C) Vv A(B, D). 


Since this is all the possibilities, the result follows. 


Answers to Chapter Il, section 7 


II.7.3 The reasonable assumptions we could make are that each image is a set of points characterized by spatial coordinates 
and color coordinates. The spatial coordinates for a photograph are in two dimensions and form a filled square, they are thus 
compact. If we assume that the colors are a closed and totally bounded region, say grey scales from 1—100, then the product 
space of pictures is compact. We assume some continuity in time, this is backed up by physical assumptions, the tree will 
probably not do something totally disconnected from its motion in the interval before. Finally, we assume that the tree waves 
at finite speed. This implies that the change between photographs in picture space will decrease as the time interval shortens. 
This should yield a Cauchy sequence given that the distance between two photographs is the Hausdorff metric, applied to 
successive pictures as sets of points with (x,y,color) coordinates, and, say a Euclidean metric on this 3-space. A looks like a 
photograph of the tree at 1 second. 

11.7.4 In the space of non-empty compact subsets of the Sierpinski triangle, an example of an infinite set is (among others) 
the set of all singleton sets {x} with x € A. A Cauchy sequence in this set is given by successive righthand corners of 
triangles with lefthand (lower) corners at the lefthand lower corner of A. This sequence converges to the lefthand lower 
comer of A, the convergence implies that it is Cauchy. 

11.7.6 If X is complete, then so is 7((X). Since this is the case if X is compact, it remains to show that 7{(X) is totally 
bounded. Let € be given, and let {y,, ..., y,} be an €-net for X. The singleton sets {y,}, ..., {y,} are elements of H(X) since 
such sets are compact in a metric space. Let A € #{(X). The distance h({y;}, A) is given by 


d(yi, A) ¥ d(A, yi) 


where by definition, the first distance is the distance from y; to the closest point in A, and the second is given by the 
maximum over points in A of the shortest distance from one of those points to y; that is the maximum distance of a point to 
y; SO it is clear that this set of singleton sets does not in itself form an €-net of 7((X). However, since the collection is finite, 
its power set is also finite (the collection of all possible sets made from the points y;. Take that union of singletons such that 
the distance from each of the singletons to A is less than € and call this set Y. This certainly exists, since each element of A 
is closer than € to some y,;. Then the first element of the above formula is less than €. So is the second. For every element of 
A the distance to one of the elements in {y;} is less than € consequently d(a, Y) < eVa € A. Hence 1{(X) is totally bounded 
and hence compact. 


Answers to Chapter Ill, section | 
III.1.1 For all x € X we may write 
mcopies 
f™ 0 f"(x) = f(FC-- FCF" @)) +) 
mcopies ncopies 
= f(F(-- FCF CFC-: F@)---))---))) 
m-+ncopies 
ai ft lz). 


TI1.1.2 Let f :R —> R with f(x) = 2x. Let f(x) = f (x2). Then 2x; = 2x2 which implies that x; = x2 so f is one-to-one. 
Let y € R. Then for any y, 1/2y € R, so that f(1/2y) =2(1/2y) = y. Then f is onto. Since f is both one-to-one and onto, 
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it is invertible. Each iteration multiplies the result of the previous iteration by 2. Hence we have 
7") =20C-< Cxr)-*3))=2'x. 
ee’ 
ncopies 


Similarly, 


J = 5655 = G0 +) = 2X. 
ncopies 
Thus for any integer, we have f”(x) = 2"x. 
11.1.3 Let f : [0,1] — [0, 1] with f(x) =1/2x. f is one-to-one since 1/2x; = 1 /2x2 => x; = X2. But f is not onto, since 
f(x) =1>x =2¢ (0, 1]. Hence f is not invertible. 
111.1.4 Let f : [0,1] —> [0, 1] with f(x) =4x(1 — x). Taking the first derivative, we have f’(x) = 4 — 8x, which has a 
zero at x = 1/2 which is a maximum. The value of f there is 1. The minimum value must be at one or the other or both 
endpoints. The function takes the value 0 at both endpoints. Hence the function is onto. It is not one-to-one however, for 
example f (1/4) = f (3/4) = 3/4. 
11.1.5 The first thing to show is that f(x) € C if x € C. We demonstrate this by noting that if x € C then x is the endpoint 
of some closed interval of the form either [(2"3”)/(3*), (2"3" + 1)/(3*)] or [(3” — 1)/(3*), (3")/(3")]. Hence f raises the 
power of the denominator by 1 and is still of this form. So let f(x) €C. f is one-to-one since x,/3 = x2/3 > x; = X2. It 
is not onto, since 1 € C and x/3 = 1 > x =3 ¢C. To find another transformation, note that if x « C then 1 - x € C. So 
f(x) = (1 — x)/3 is another such transformation. 
11.1.6 Let f : R? > R? with f(x, x2) = (2x1, x3 + x1). f is not invertible because it is not one-to-one. Let y= f(x) = 
f (x1, X2). Then y = f (x;, —x2) as well, since x2 appears in the expression for f only in squared form. 
11.1.8 An affine transformation applied to an interval takes it to another interval, possibly reflecting it (transposing the 
endpoints in interval notation), hence 
, 2a ifa > 0, 

FOL 2D=all.21+0= Teh a te] taco. 

To map into the interval we must have 
l<a+d<2, 1<2a+b<2 


The new length of the interval will be a, so that 0 < a < 1, with a = 0 mapping the interval to a single point. 
If f, g:[1, 2] > [1, 2] then f({1, 2]) Cc [1, 2], hence 


go f(ll,2pc ei, 2) C [1 2) 
fog(l, 2) c fl, 2) c U1, 2] 


so that the compositions also map [1, 2] into itself. 
We now want the conditions such that 


fog(X)Ugo f(X)=xX 
Let f(x) =ax +b, and g(x) =cx +d. We make the simplifying assumption that f o g(X) maps the interval to the lower 
end of itself, and g o f(X) to the upper end, and the two overlap across the middle at at least one point. We write out the 
compositions explicitly: 
f og(x) =a(cx +d)+b=acx +ad +b; 
go f(x) =c(ax +b) +d=acx+bc+d. 


The contraction of the interval is the same in each case, and is equal to ac. In order for the two maps to overlap, then, the 
length of each interval must be at least 1/2. We therefore have 1/2 < |ac| < |. There are two cases: If ac > 0 then f og 
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must map the interval to [1, 1 + ac] and g o f must map it to [2 — ac, 2]. We have 


ac+ad+b=1 
2ac+bc+d=2 
ad+b=1-—ac 


bc +d =2 —2ac 
If ac < 0 then f o g must take 1 + 1 — ac and 2+ 1. Similarly g o f takes 2+» 2 + ac and 1 b 2. This gives 
2act+ad+b=1 


ac+bc+d=2 
ad+b=1-—2ac 
beo+d=2-—ac 


Given two values for a, c satisfying 1/2 < |ac| < 1, these equations allow calculation of b, d such that the above condition 
is satisfied. 


1H1.1.10 1, = f”(Io). Consequently, we have f (/,) = f(f"Uo)) = f n+1(Jo) = I,4,. We are given that J] = U%,], is a finite 
connected interval, hence 

iain = G aashe Boer = Bae ? =! \ Io 
Then the length 2 = length(/y) + length( f (7)). The length of f (J) is the amount that it was contracted, hence is equal to 
af. This is equal to the length of J \ Ip = 2 — b. This gives af = £ — bor £=b/(1 — a). This yields 


co 


= a 
yb-a Was 


n=1 
III.1.11 Let f, g : R — R be polynomial transformations. Then if 
f(x) = ap tayx +++» +ayx™ ; g(x) = bo +b, +--> + byx™ 
we have 
f 0 g(x) =a + a1g(x) + a2(g(x))’ +--+» + aw(g(x))” 
If we expand g(x) in the last term in the expression, the term with the higest power of x is ay(byx™)”. Since by definition, 


an, by #0, ayb¥, £0 so this is a polynomial transformation of degree M - N. Using this, the degree of f(x) is N”. 


I11.1.12 For n > 1 a polynomial function f : R — R of degree n is generally not invertible. A quick example is any 
polynomial having a; = 0 for all odd i. More precisely, suppose that f(x) is invertible. Then there is exactly one number b 
such that 


Ga BD ee ue == 
because the function must be one-to-one. This polynomial must then factor into the two terms 
(x — b)(co beyx Hees Hep" |) = an +++» tayx™ 


The polynomial cg + ¢)% +-°-+ cy—1x%—! either has no zeroes or it has only one at x = b. Assume not. Then by adding or 
subtracting from co, we can force it to have a zero, thus creating a new function with two roots. Such a function is then not 
one-to-one and therefore not invertible. 

I11.1.13 For large enough x, |x| >> |x¥~| >> ---. So for large |x|, f(x) looks like ayx™ which, for |x| > 1 is expanding, 
since |x/ — x3| > |x; — x2]. If J is an interval {x : |x — a| < 5} for fixed a,b € R, then if M > 0, L£(f ))/f UW) > M for 
b> B. Let 1 =[a — b,a + 5]. Construct the polynomials and subtract. 

I1I.1.15 Parabolae opening up or down will work. For example f(x) = a(x — 1)? where a <2. 
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III.1.16 The function is continuous. f,(0) = 0 for all 4. It is maximized by setting the derivative equal to zero, as follows: 


fx(x) = 2Ax - Ax? =A(2x — x?) 
fix) =A(2 — 2x) 


so that f{(x) = 0 when x = 1. This is a maximum, since the second derivative is negative, and has the value f,(1) =A. 
Hence if 4 € [0, 2] then f,(x) : [0, 2] — [0, 2]. The fold occurs at (1, A). 
I11.1.17 To be a fold point of f(x), x must be a local maximum or minimum. If f(x) is differentiable, the condition for 
this is f’(x) =0 and f”(x) 4 0. Polynomial transformations are differentiable. If f”(x) = 0 we have an inflection for some 
functions, (for example f(x) = x*) so no fold exists at x. 
III.1.21 To show that the M6bius transformation is invertible, we show that it is one-to-one and onto, as follows: 
one-to-one: 
Let f(x,) = f (x2). Then we have 


ax, +b a FD 
ex; +d cen +d 


which yields 
ACX|X2 + bcx2 + adx, + bd = acx\x2 + bcx, + adx2 + bd 
or, bcx2 + adx, = bcx,; + adx>. That is 
(ad — bc)x, = (ad — bc) x, 


Since the ‘determinant’ ad — bc is assumed to be non-zero, x; = x2 and the function is one-to-one. 
onto: 
Let y € R, we need to find x such that 


_ axt+b 
oe ed 


We have 

cxy+dy=ax+b 

cyx —ax=b-—dy 

(cy —a)x =b—dy 

iy 
Pipe cas 
cy-—a 

Since ad — bc #0 we have (—a)(—d) — bc #0 so this is another Mobius transformation, and f is onto. 
I1I.1.22 We want to show that if f;, f. are Mobius transformations then f; o f is. Writing 


_ ax+d _ ax +b 


ie (3m te 


we make the necessary substitutions: 
anx+b 
Qi etd, + Pl ay(agx + bp) + bi(Cox + dy) 


one 4d, Ci(a2x + by) + dy(cox + dp) 


_ (a2 + bic2)x + (a,b. + bi d2) 
(cya2 + d\c2)x + (boc) + d)d2) 
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which has the form of a Mobius transformation. We need to check: 
(a\a2 + biC2)(b2c; + didz) — (ayb2 + byd2)(C1a2 + d\C2) 
= (ajazb2c, + a\a2d\d_ + b,c2b2c, + b\c2d\d2) 
— (@,b2C\a2 + a,b2d\c2 + b\d2C\a2 + b\d2d;C2) 

= (€1@2d\d2 + biC\b2C2) — (aid b2C2 + a2d2b1C)) 

= b2¢2(bic; — ad) + a2d2(a\d; — bc) 

= (adz — b2C2)(aid; — b,c) FO 
because f;, f2 are Mobius transformations by assumption. 
111.1.23 The action of the transformation f(z) = 1/z on the Riemann sphere is to flip it over. It takes 1 > 1, -1+ —1 so 
there is no rotation, and 0 +> co, co +> 0 which inverts the two poles. 
111.1.24 When f{ (00) = co then a/c — 00, since a < 00 we have c = 0. Conversly, if c = 0 then f (00) = 00 by definition. 
Hence if 00 +> oo then the transformation looks like 


which is an affine transformation. 
1H1.1.25 f :R— R takes 1 +> 2,2++ 0 and 0+ oo. So we have d = 0 and the function looks like 


ax+b 
cx 


f(x) = 


1 +> 2 implies (a + b)/c =2. 2+ 0 implies 2a + b = 0 or b = —2a. This yields —a/c =2 so that f(oo) = 2. We can 

only determine the transformation up to a constant (that is, multiplying top and bottom by the same constant yields another 
identical transformation) so set c = 1. This gives a = —2, b = 4 so the Mobius transformation 

—2x +4 

f(x) = 


will do it. 


Answers to Chapter Ill, section 2 
III.2.1 Let the affine transformation be denoted, as usual, by 


i) Gar) 


Since (0, 0) is mapped to (4, 5), we have e = 4, and f =5. From the relation (0, 1) +> (—1, 2) we have 
—1 a b 0 4 
(2) = (6 a)Q) +3) 
—1 b 4 
(7) =(2)+(5) 


which gives b = —5 and d = —3. Finally, using (1, 0) +> (3, 0), we have 


(0)=(¢ 3) (0) +(s) 


or, 


Or, 
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which gives a = —1 and c = —5. The function then is 


G)=(s 3)G)+(5): 


11.2.2 The affine transformation will be invertible if the matrix A is invertible, since if y = Ax +t, then A~'(y — £) yields 
x again if A~! exists. Let A and A~' be denoted by 
a b 
=(¢ 4) 


ma a) 
CF i)( a)=G 1) 


Then we have, by definition, 


Or, 
ea+ fec=1 
eb+ fd=0 
ga+hc=0 
gb+hd=1. 
Solving the first two for e, f, 
igen E! 
ae 
—fad+ fbc=b 
or 
—b d 
t= aa Oe 
Solving the second two for g, h, 
—he 
i. 
a 
—hbc +had=a 
or 
os =e ° — a 
e~ aetA” det A 


Putting these together, we get 


ie a a Ss 
~ dettA\—c a 


Consequently, A! exists if and only if det A £ 0. If it does, then if (x1, x2) + (y1, y2) we have 


(2) =aaalGy Mar) 
x.) detA\—c a yo — f 
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11.2.3 Let fi, fo: R? > R? with 
and 


Then we have 


fh=nohn=(* 5)((S &) 


_ [{aa'+be' ab'+bd'\ (x 1 ae +bf +e 
~ \ca’'+dc’ cb'+dd')\ y ce’'+df'+f 
Letting 
a” =aa’+be' 
b” = ab’ + bd’ 
ce" =ca'+dc' 
d" = cb’ + dd’ 
e”=ae'+bf'+e 
f’ =ce'+df'+f 
then, 


which is affine, as desired. 
T11.2.4 Let 


Then we have 
ee (¢ ;) (; £) = (Sik een 
c ay\e fh ce+dg cf+dh 
The products subtracted to form the determinant det(AB) are given by 
(ae + bg)(cf + dh) =aecf + aedh + bgcf + bgdh 
(af + bh)(ce + dg) =aecf +afdg + bhce + bgdh 
Subtracting these yields 
det(AB) = aedh + becf —afdg — bhce 
=ad(eh — fg) — bc(eh — fg) 
= (ad — bc)(eh — fg) 
= det A - det B 


as desired. 
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III.2.5 The easiest way to visualize this is to break the desired transformation into two transformations, one which takes the 
triangle to the line between (0, 0) and (1, 1) and one which takes this line to the line between (2, 2) and (1, 1). The second 
transformation is an inversion and translation: 


G)=(6 3))+(5) 
(1)=(2 @)G)+@) 


which gives a + b—2 = 1 andc+d —2 = 1, which by letting b = c = 0 gives a = —1 andd = —1. The first transformation 
takes the triangle to the line between (0, 0) and (1, 1). (0, 0) + (0, 0) so that e = f = 0. We have 


()=(e a) 
()=(¢ a) (0) 


so that a = c = 1. Multiplying the two together, we have 


(Co 2)G G)+G)-G A)G)+() 


To put this in scaling ratios and rotation angles, let 


which gives e = f = 2 and 


so that b = d = 1, and 


ntnm=v2 


Then the needed rotation angles are 


—|] 
cos 6; = sin@ 1 = cos 62 = sin @, = —= 


V3 


v=(¢ a)(a)*() 


Since 3 is a translation term, and translation does not affect the area of any transformed polygon, without loss of 


which gives 6; = 6, = >. 


T11.2.6 We start with the hint. Let 


generality e = f =0. As well, it does not change anything to assume that one vertex of the triangle is at the origin. A 
triangle is then determined by specifying two points (x), x2) and (y;, y2) such that (y1, y2) # @ - (x), x2) (which makes them 
non-collinear), and neither is the origin (which makes less than three distinct vertices, hence no area). First, show that the 
area of the parallelogram in figure 2.6 is given by 


area = |x, y2 — X2)| 


(this is the length of the vector in a vector (cross) product). To see this, rotate the parallelogram until (y;, y2) becomes 
(y;, 0). Then if the new coordinates for (x;, x2) are given by (xj, x3) the area of the parallelogram is the base times the 
height, or y}x,. The rotation is —@ with 


yi cia 2 


Vy +y2- Vi + y¥3 


cos @ = 
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Figure 2.6. Getting 
the area of a triangle from 
a parallelogram 


We have: 
cos@ sing X,\ _ { x, cos@ + x2 sin@ 
—sind cos@)\x.] \x.cos@ —x,sin0 
cos@ = sin@ yi \ _ fy cosé + y2 sind 
—sind cos@)\ yy.) \ y.cos@ — y, sind 
so that 


y,X5 = (y1 COSA + y2 sin 8)(x2 cos @ — x, sin) 
(y? + ye) yix5 = yixey? — yxy 2 + Yox2ViX2 — x1 y2yF 
= y?(x2y1 — x12) + yo (xay1 — X12) 


yielding |y;x{| = |x1y2 — y1x2| as desired. The area of the desired triangle is then 5 |x; y2 — y2x1|. Apply the transformation 


e(a)=(¢ a)() 
x2 ccd x2 
to each of the two vectors. Then the new area is derived by 
X;=ax;+bx, = y, = ay, + byr 
xX,=CX%+dx; y,=cytdyr 
yielding 


1 
5 lacx,y; + adx,y2 + bex2y, + bdxzy2 — acx,y, — bex,y2 — adxzy, — bcxry>| 
1 
5 \(ad — bc)x,y2 — (ad — bc)x2yj| 


] 
= 7 det Al|xiy2 — x2y1| 


A polygon (and many other surfaces) can be broken into triangles on each of which this formula applies, which finishes the 
proof. To see that det A < 0 corresponds to reflection, write 


e((o i)(¢ a))=-#e( a) 
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Note: The vector/cross/wedge product used to do the proof this way can be generalized to be applicable on other types of 
surfaces and hypersurfaces. The generalized version preserves the form | det A|-(“area’) to such surfaces and volumes. 


111.2.7 Let w(x) = Ax +t be an R? similitude. Then w has one of the forms: 
x \  freos? —rsin@ \ / x; e 
w(%) “(ss rcos@ YEG) 
x1\ _f(rcos@ rsin@ x e 
w(*) = ee platy |e BI 8 
The first can be rewritten as 


a cos@ —sin@ x ce). x1 e 
o(S)=r(Ste coe )(2)*(5)=7%(2)+(5) 
By making the substitution @ = —0 the transformation 
yy cos@ —sin@ 
c= E == | ) ( sin@  cos@ ) 

__f cost? —sne 

~ \—sin@ —cosé 

_(cosd@ —sing 

~ \sing@ cos¢d 
so with this substitution, the second form for a similitude becomes 


Xj Xj é 
w =rRR + 
(::) =r (2) +(3) 
which has the proper form. 


II1.2.9 There are many possible ways to write a similitude w(x) = Ax +1 which will take the Sierpinski triangle to a subset 
of itself, and we give the simplest one. To take the triangle to the subtriangle with corner (0, 0) we note that by taking 


(0, 0) + (0, 0) 
1 
1, Or (5,0) 


1 Ll 
we have completely determined w. The first transformation implies that 
ee 0 
=U ria 


For the other two, 


so thata = 3 andc =0. 


so that b = 0 and d = 3. Then w looks like 


Selected Answers 439 


III.2.10 The simplest way to show this decomposition is to multiply it out. 


r, O cos@ —sin@ r; O 
0 n sinO cos@ oe 
_f{ricos@é —r,sin@\/r3 0 
ie 2 sin 0 rz COS 6 0 r 4 
_ {mr3cos@ —r,r4sin@ 
~ \ rorgsin@  rergcos@ 


This is a general affine transformation because the four terms 7173, 7174, r2r'3, and r2r4 are linearly independent. This means 
that none of these can be written as a linear combination (sum with constant coefficients) of the other three, hence any 
arbitrary value can be written this way. It is more important to understand what is going on geometrically, which is done 
by looking at the action of this transformation on the coordinate axes. The transformation by r3, r4 rescales the coordinates, 
that is stretches or shrinks each coordinate and possibly reflects one or both. The rotation matrix rotates the coordinates 
by some angle 6, leaving the rescaled coordinates off axis. The action of the second rescaling has two functions. It further 
stretches or shrinks the rotated axes, but because they are no longer lined up with horizontal or vertical, it also changes 
the angle between them. This has the effect of skewing the new map of the plane. We are left with a transformation which 
translates, rotates, rescales, reflects and skews. This is the most general affine transformation, and requires all three parts of 


the decomposition to accomplish it. 
111.2.12 We multiply the two matrices to get the identity matrix: 


b d —b ad—bc ba—ab 

a det detA \ —|[ deta det A 

’ 2) ( —c a ) = ( cd—dc —bc+ad ) 
det det A det A det A 


> 


> 


which proves the statement. 
117.2.13 Let 


By 
ll 
Yi 
2 8 
a 
Sisal 


By exercise III.2.12 we have 


Then we have 


440 # £Selected Answers 


det B det B 
gd—hc ah—gb 


det B det B 
aed—afc+bgd—bhc a2 —aeb+bah—gb* 
sais det B det B 
ced— fc*-+gd2—dhe caf —ceb+dah—dgb 
det B det B 


which gives 


1 
tr(BAB™') = —y (aed — afc + bgd — bhc + caf — ceb + dah — dgb) 
1 
2s = _ 
aa 8 (ade — bce + adh — bch) 
det B 
as qa pet) =trA 


The other equation is somewhat simpler: 


det(BAB™') = det B det A det B~' = det B det B~' det A = det(BB™') det A 


Now BB™! = / the identity matrix, and det J = 1, so 
det(BAB~') = det(BB~') det A = det A. 


II1.2.14 We simplify the problem by noting that if g is any scalar, 


1 
1A@*)| _ [ee ee 
lg q?(x? + x2) 


2 28 2\ 3 
= (2 (ax, + bx2)* + q*(cx; + dx2) ) 
q?(x? + x3) 
{Ax 
|x| 

consequently, it suffices to maximize over x such that |x| = 1, that is, over the unit circle. For the Euclidean metric then, we 
let x; = cos @ and x2 = sin@. Then recognizing that the derivative will be zero only if the derivative of the quantity inside 
the square root sign is zero, we maximize 


0= F((acosé + bsin 6)” + (ccos@ + dsin6@)’) 


= 2(acos6@ + bsin@)(—a sin@ + bcos 8) 
+ 2(ccos@ +d sin 9)(—c sin@ + dcos 6) 

= —q’ sin 6 cos 6 + abcos* 6 — ab sin* 6 + b* sin @ cos 6 
—c’sin6 cos @ + cd cos’ 6 — cd sin’ 6 + ad? sin @ cos @ 


bY? — a* 24 ¢7 


d 
sin 26 + abcos26 + sin 26 + cd cos 20 


0 = 2(ab + cd) cos 26 + ((b? + d*) — (a? + c’)) sin 26, or, 
((a? + c*) — (b* + d?)) 


20 = 
= 2(ab + cd) 
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This last equation has (in general) four solutions. Two are minima, two are maxima, and the norm exists. The unit circle has 
been transformed to an ellipse, the semi-major axis of which is the norm of A. 

For the Manhattan metric, we again can restrict to the unit circle which in this metric is the set of points (x,, x2) such that 
|x; + x2| = 1, and is a diamond centered at the origin, with vertices on the axes at (+1, 0) and (0, +1). By looking at the 
Euclidean case, we can see that the transformed version of this figure under A will contain two maxima and two minima. 
From exercise JI.2.9, this transformation is a composition of a coordinate rescaling, a rotation, and another coordinate 
rescaling (and possibly a reflection). The result is again a parallelogram centered at the origin. The triangle inequality will 
now tell us that the maximum is at one pair of vertices. These are the images of the original vertices, hence one of these two 
images is the maximum. We have 


A(X1, X2) = (ax, + bx2, cx; + dx2) 


at the vertices we have |A| = max((a +c), (b+d)). 


Answers to Chapter Ill, section 3 


III.3.1 (i) The most general Mobius transformation such that f (00) = oo is of the form f(z) = az + b. To show this we 
need to show that if f(co) = oo then c = 0. Suppose not. Then for c # 0 we have f(oo) = a/c # 00 because a is finite. We 
thus have 


az+b _ adz+bd 
Be Age 


Define a’ = ad/|d|? and b’ = bd/|d|?. then f(z) = a’z + b’ as desired. 
(ii) Suppose f : R* > R? is a two dimensional similitude without reflection. The we may write f as 


x cos@ —sin@ x e 
4 8 rl oer cos 6 \G)+G) 
Let z=x+iy,b=e+if and a=a, + ia. We have az = (a,x — any) + i(a;y + a2x) which we can write using real 


coordinates and matrix notation as 
a, —a\_,(% ) 
a a, y 


Letting det A = r? (det A is positive since we presumed no reflection) and 0 = cos~!(a;/r) we have our result. Since we 
divided out det A the determinant is 1 and with a = d and b = —c, the new determinant must read cos” 9 + sin*@ = 1, asa 
check that the new values match up. 


f(zZ= 


III.3.2 We wish to prove that the operation f(z) = . can be expressed as mapping the complex plane with infinity, to a 
sphere of radius 1, with equator at |z| = 1, rotating the sphere by z around the real axis (the axis between -1 and 1), and 
mapping the result back to the plane. We first establish that for at least some points this is correct. 

(i) The results of this mapping on the points O and co would be to exchange them. The map f(z) = t does indeed 
exchange these two points. 

(ii) The result on the equator is to flip it about the real line, and map it back to itself. Thus every point on the equator 
\z| = 1 is mapped to the equator. If we were to look down from above, the result of the rotation would be to exchange the 
halves of the equator above and below the real line, that is, each point on |z| = 1 should go to it’s complex conjugate. We 
have 


1 Zz 
f@=-=—a 
.. Ee 
For points on the equator, the denominator of the last expression is 1 and each point does indeed move to it’s conjugate. We 
note in passing that having specified the action on these many points, as well as that we are using a Mobius transformation 
is in fact enough to prove the assertion, but we will perform the explicit map. 
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(iii) We need a coordinate system to represent points on the sphere. Since the map used to generate the Riemann sphere is 
to map each point to the point on the surface where a line connecting the north pole to the plane point intersects the sphere, 
we choose the angle @ between this line and the diameter of the sphere connecting the poles (interior angle). This gives us 
a “latitude” coordinate, for a longitude, we take the polar coordinate angle of the plane point, with respect to the positive 
real line. Expressing the point z in the plane as re’®, this amounts to choosing 6. We need the coordinate transformation for 
@. The line from the north pole to the point z forms the hypotenuse of a right triangle, the side adjacent is the radius of the 
sphere to the north pole, and the side opposite is the plane vector to z. Consequently, we have 


r=(|z|=tang 
We may thus express the coordinate transformation as 
= 2, + iz. =tan@cosé +i tangsin#d 
or more simply as 
z = tan ge’’. 


How do these coordinates change when we rotate the sphere? As with the equator, the 0 coordinate becomes the negative 
angle with the positive real line, that is 9 +> —@. For @ we need some plane geometry. First, we note that if we have a 
diameter of a circle in the plane, and a point not on that diameter, the triangle formed by the chords between the endpoints 
and the third point is a right triangle. To see this, add the radius to the third point to divide the triangle into two isoceles 
triangles, and sum up the angles at the center of the circle. Second, we take the same planar circle, and draw the lines 
connecting a point on the perpendicular bisector of the diameter to the endpoints. If we now draw a line from one of the 
ends of the diameter to the point at which the line from the other end to the bisector intersects the circle, the triangle so 
formed is similar to the two triangles formed by the lines previously drawn to the bisector. Consequently, given that the 
tangent of the angle from the north pole to a point on the sphere is tan @, the tangent of the angle formed by the south pole 
is cot d. 

When we rotate the Riemann sphere around the real line, we have exactly this situation, namely that the angle @ of the 
original point on the sphere is now the angle measured from the south pole. Consequently, the tangent of the new angle to 
the north pole is cot ¢. We now look at the action of f(z) on z = tan ge’’. We have 


so the spherical coordinates have taken 6 +> —@ and tang +> cot ¢ as desired. 
Ii1.3.3 If f(z) is not a similitude then, by problem 1, c 4 0. We may thus let 


a 
e=— 
Cc 
ad — bc 
——— 
td 
s=— 
¢: 


az+4q (S#%) 
2 cztd czt+d 
az+b 
~ ezt+d 
= f(z) 
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as desired. 
II1.3.6 We wish to show that a MGbius transformation f : C > Cis always invertible. We divide the problem into it’s two 
cases: Let 

az+b 

cz+d 


(i) Suppose c = 0. Then the Mobius transformation can be rewritten as f(z) = a'z +b’. Let w = f(z). Then the inverse is 
given by 


f(z) = 


L= 
a 


(ii) Suppose c # 0. Then the transformation can be rewritten in the form 


i 
i 
f@) ee 
Let w = f(z). Then the inverse may be written 
Z=st f 
w—e 


III.3.7 See answer to exercise 1.22 in Chapter III. 


III.3.8 We wish to transform the real line to the unit circle in C. This means taking z = z; + iz2 such that z. = 0 to |z| = 1. 
Looking at the Riemann sphere, this can be accomplished by rotating the north and south poles to the equator. Specifically, 
we take 00 +> i and Ot» —i. Since we need one more point to define the transformation, we may do this by insisting that 
we rotate about the axis between 1 and —1 by taking 1 +> 1. We have 


e a ° 
COr> 1 => -=!1 
¢ 


b 
ee 


a+b 
1 l= = 
c+d 
From this we get 
a=Ic 
b=-—id 
a+b a 
ae 
a+b=c+d 
ic-—id=c+d 


Choose a = 1. Then we have 
zt+i iz—1 
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III.3.9 We do out the first few terms of f"(z) to see what we are dealing with: 


PO=: 
1 

1 an ssieansiansins 
IOs 1+z 

2 mS 
f@= Eee 

: 1 
i eee pias 

ree | 


This object is known as a continued fraction and is usually written as 


[do, 0), ..<, 0) 6g ee — 


There is a whole body of mathematics to deal with these interesting creatures, they can be used to represent numbers, 
functions, traces of matrices, and other things. We need only the following relations: Let pp = ao, pj = a\€9 + 1, px = 
Ay Pke-1 + Pr-2 aNd qo = 1, qi = 4), Qe = Ae Qu—-1 + Ge—2- Then the following relation holds: 


is i. ale 
qk 


For our function, we have 
F@O=0.4 1:.:, 1,2] 


with n one’s implied. If we look at the growth of the terms p; they represent (for terms before the n" term, the terms in a 
Fibonacci series. That is, let Fy = 1, F; = 1, F,, = F,_1 + F,-2. The first few terms look like 


L, tee G, to; ss 


Since po = 0, p; = 1, p2 = 1, p3 = 2, and so forth, we may write p, = F,-1, for k < n. Similarly, we may write q, = F;, for 
k <n. Then f"(z) may be written, using the above relations, 


fr(g) = Se 
For the inverse transformations, we look at 
w= = =>z=-l+ : 
+z 


Writing out the first few iterations, we have 


fw) =-14 = 
Ww 


—2 a: 
f-"(w) = Bee oe’ 


i 
w 
Again we have the form of a continued fraction, this time 


f-"(w) =[-1, -1,-1,...,-1, w] 
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Using exactly the same kind of analysis as before, we have 


Po=—-1 ; gq=!1 
Ppa=2 3; g=-l 
Po=-3 3 Q=2 


=F: HET; 


which gives 
— = wF, + F,-1 
f ais 2 wF,-1 + F,-2 
III.3.10 We want the geometrical interpretation of the map 
1 
f(z) =i + — 
t=1 


To get this we first note that by rewriting 


1 iz+t1l+1 iz+2 
{@=i¢— = = 
ae at z-l 


we have f (co) =a/c =i, f (i) = 00. This suggests we first translate the plane to bring the point i to the origin. For this we 
have 


O(z)=z-i 
6-"(z) =z+i 


1 1 
ao foo) = (i+ . )-i=s 
Z+i-} Z 


Consequently we can describe the mapping as translating the plane up by one unit, mapping the plane to the Riemann 
sphere, rotating the Riemann sphere about the real line, and mapping back. The net effect is a translation and an inversion. 


Answers to Chapter Ill, section 4 
11.4.2 To be correct inverse functions for f(z) we must verify that f(f -1(z)) = z. For points in the upper half plane, we 
have 
f (a(x, X2) + ib(x1, x2)) 
= (a* — b*? — 1) + 2iab 


JV (1 +41)? +43 +1441 V(l+%1)? +43 -1l-xy 2 
2 


EES SS SS | LS 


a 2 2 


3 Vadtx)?+x3 +144 J +x)? +23 -1-—x, 
V 2 2 
a, (d+) +23 - (+41)? 
= 4 


=X, + 1x2 =2 


so that w, does indeed invert f(z) there. w2 differs from w; in the above derivation only in the imaginary part which replaces 
2iab with 2i(—a)(—b) and hence also inverts f(z). Finally, for the lower half of the plane, we notice that we are in need 
of a negative sign preceding the imaginary term, since the positive square root is indicated. This is provided by replacing a 
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by —a. This has no effect on the real part, because a appears there only as an a” term, completing the test for the inverse of 
gas 
11.4.3 Let f(z) =z? + 1. The critical points are those values of z such that f’(z) = 0, that is, the first derivative vanishes. 
f(z) = 2z = 0 when z = 0 so this is the critical point of f(z). The critical values of f(z) are by definition the range values 
at which f’(z) vanishes, that is f(z) such that z is a critical point. This occurs at f (0) = 1 so 1 is acritical point for f (z). 
III.4.5 We determine the local approximate affine transformations for these functions by expanding about Zo in a Taylor 
(Laurent) series for f(z), and restricting to the first two coefficients ap and a). 

(a) f(z) =z’ and z = 1. ag = f(1) = 1. For the linear coefficient, we have 


ap ve 


a, = T 2 


Letting w(z) be the linear approximation, we have 
w(z)=14+2(2—-—1) =2z-1 
(b) f(z) = + and z =1+i. We have 


1 
ao = f (%) = ee 
a ae 


Using the same notation as in (a), we have 


ee eee ——— (z — (1 + i)) 


1+i ay 

1 —Z 1+i 
Tin i4t Aree 
é es 


(c) f(z) =(z — 1)? and z = 1 — i. We have 
ay = f (zo) = ((1 —i) — 1)° = (-i)? =i 
= f'(%) = 3(% — 1)? =3(-i =—- 
Putting these together gives 
w(z) =i + —3(z — (1 —i) 
=i+3-—-3i —3z 
= —37+3—-2i 


Answers to Chapter Ill, section 5 
II1.5.3 Let x, be the fixed point of f(x) = Ax + t. Let 0(x) =x — xy. Then 67'(x’) =x’ + x,, and we have 
606 '(x')= AQ’ +x) +t—x, 
= Ax’+ Axp+t—x, 
= Ax’ +x — xf 
= Ax’ = f'(x’) 
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as desired. 


III.5.4 Using the coordinate change notation 0 : K > X we have 0(x) = 2x — 1, which yields 9~!(x’) = (x’ + 1)/2. We get 
the transformed /’ in the new space by 


f'(x’+) —Ao a 067! (x’) -X’ = Xx’ 


f 2 ! 2 
poor =((7 +t) -1) +1=(+ =] mv 


eee oe! 
eo fooa)=2((25*) +) —1= 5! - 141 


II1.5.5 We first find the fixed points of the MGébius transformation 


which we get from 


giving us, 


ie ee! 
— i = . = 
Multiplying through we get 
z+2=42—2° 
or 
z*—3z+2=0 


which factors into 
(z —1)(z —2) =0 


There are two fixed points, at z = 1 and z = 2. 
Make the coordinate transformation which takes 1 +> 0 and 2 +> oo. The transformation 


—1 
02) = —> 
will work. We need the inverse of this: 
z—1 
Sg 2 
(Z—2)w=z-1 


zw—-2w=z-1 
wz-z=2w-1 
2w — 1 


z=0 '(w) = 
w—l1 


We make the composition f’(w) = 00 f 067'(w) as follows: 


Pr eae aa 
4— wl 4w-4-2w+1 2w-3 


so that, 


3-1) 4w—-3-2w+3 2 


4u-3 2° 4w—-3-4w+6 3 


60f 00 '(w)= w 
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This map shrinks the sphere by a factor of 2/3. The whole process can be written by expanding the real line from 1 to 2 to 
the the whole positive real line, then shrinking the resultant Riemann sphere, then reducing the positive real line to the line 
between | and 2 again. The net result is shrinking the sphere in a non-uniform manner. 


TIT.5.6 Let W(x) = Ax +t which we express as usual. The fixed point of this transformation is 
_{%*1\)_{@ b x} é 
o=(3)-(C a) +) 


Xf, =AXy, + bxp +e xp =cxy, +dxzp,+ f 


which we rewrite as 


We solve this as a system of linear equations: 
O=(a—1)xs, + bx, +68 
O=cxy,, +(d—- I)x 4, + f 


which gives 
O=c(a — 1)xz, + bex;, + ce 
O=c(a —1)xz, + (a— 1d - I)xp, +e 
0=((a— 1) — 1) — bc)x,,+ (a—1)f —ce 
O = (det(A — /))x7,+ (@—1)f —ce 

yielding 


ce —(a—1)f 
‘fa Get(A — 1) 
A nearly identical derivation gives 
mee bf —(d—l)e 
ft det A =~ 1 
Notice that these equations do indeed become invalid if det(A — J) =0, when the “fixed point” tends to infinity. To 
transform coordinates so that x ¢ is the origin, choose the transformation 6(x) = x — x. The inverse of this, letting w = 6(x) 
is just x = w + xr. The whole coordinate transformation is then 
Oo f 06 '(w)=(A(w+xp) +t) — x, 
=Aw+Axs+t—xy 
= Aw + (Ax; +t) —xy 
=Aw+xs—XF 
= Aw 
So that the effect, in the transformed system is just the action of the linear transformation Ax. 
II1.5.7 By exercise III.2.13 we know that 


det D=A,A, = det A 
and 
trD=A,+A2,=trA 
Then by multiplying (A — 41)(A — A2) each A; is a solution to the equation 
A? — (Ar + A2)A + (A1A2) = 0 
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or 
1? —tr AA + det A =0 


Now 
— i 
det (¢~ ply) =©-WO-2)- fe 


=eh—ed—hi +d? — fg 
=)?-—(e+h)A+eh — fg 
=)?~—trAd+detA 


as desired. The above equation is called the eigenvalue equation for A, the numbers A, 2 are called eigenvalues, and have 
the property that there exist lines in R? such that for any v on those lines 


Av=Auv. 


The lines are called the eigenvectors of the matrix A. 


11.5.8 To make the transformation h(z) = t for analyzing the behavior at the point at infinity, we first note that h~'(w) = + 
has the same functional form. Composing, we have 


Gop 
e\+i 148 


The point at 0 is of interest, and is now a fixed point. Locally we may expand 


74 w ay + aw 
where, dp = 0 and a; is given by the Taylor expansion 
d w (7+w)—w 7 
a, = — | -— ] 0) = ————| == 
dw \7+w G+w) ii» 


so that near zero, this looks like f’(w) = (1/7)w which is a contraction. 
IiI.5.9 We have 
fulz)=x?- ws; g(x) =Ax(1 — x) 


and we want to reexpress f as fy,,). f is a parabola opening up, centered at the y-axis with vertex at —y, g is a parabola 
centered at x = 1/2 opening down, with zeroes at 0, 1. Consequently, we need to reflect f, and stretch it so that it’s zeroes 
are one unit apart, and shift it to the right by 1/2. First, the transformation 6(x) = —x will reflect the parabola as follows: 


60 f 067'(x) =00 f(—x) = —f(—x) 
which for any value of ju is 
— f(—x) = —((-x) — pw) =—x' +h 


a parabola pointing down with vertex 1 centered on the y-axis. The transformation which will move the vertex to x = 1/2 
is given by 0(x) =x — 1/2. Combining these yields 0(x) = —x — 1/2, or, 


90 f oO M(x) =—x? +x —-1/4+p—-1/2=-x° +x +4 (u- 3/4) 


We now need to adjust this so as to produce a multiplication by 4 and remove the constant term. We use the transformation 
6(x) =x/A. Once again composing, we now have 


Nie 


=< 
O(x) = <= 
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so that 
A 
6"\(x)=-Ax + = 
2 
We have 


d 
00 f,0 6"(x')=80 Fu(—Ax’ + 


2 
1 
= G05 x 42°)=— p) 
] ! 12 UL 1 
=—)(- — on > 
G x +x") eae 
re | 


5 + 7? 
This is identical to the expression for g,(x’) except for the constant term, we now find (A) by setting this term to zero and 
solving for j. 


Ae e4k pe 
a Das oS a 
? ree Bae 
= At? 
'-- 2 
—] —|] 
=—)’+—) 
His ge 
We have 
—1 
2 = —j\? 4 —} 
O(x) aS. uA) ee 
TII.5.10 We want the behavior of g(x) = x? — 1/2 near the fixed points. We have 
1 
EEO RS 
Xf=Xy ) 
x2—x = 
f J , 
1+/73 


xf = ) 
and we show the behavior around these points using two methods. 


Analysis by moving the fixed points to the origin: 
Let : 


ie ae Le 


5 ) 
l 3 
O-'(x)=x+x, =x +( es 
Then we have 


1 
goO (x) =x7 + (14 J3)x! + 1+2v3 +3 


1 
, 
143 
ox 4 n+ = 


g(x!) =0 0g 00 '(x') =x" + (14 V3)x’ 
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which has a fixed point at the origin. Very close to the origin, since x’? <x’, we have 
g(x’) © (1+ V3)x' 


which is an expansion. For the other fixed point, take 


O(x) =X —Xp_=xX- i. we 
0 "(e)=x +x, =x + - wmf 
which gives us 
gob (x!) =x” + (1 — V3)x' + eat a ; 
= x"(1— V3)x’ + = 


g'(x')=00g200 (x) =x + (1 — 73)x’ 
Again looking at behavior near the origin, we have 
g(x!) © (1 — V3)x' 
Since the coefficient is less than zero, there is a reflection, and since |1 — /3| < 1, it is composed with a contraction. 
Analysis by expanding g(x) in a Taylor series: 
The derivatives needed to expand g(x) in Taylor series up to the linear term, at the fixed points are 

g(x) =x? — : 

g(x) = 2x 
At the first fixed point, x, we have 


1 3 
ao = g)| 195 = ue 


a, = 2) a 1 3 


and the expansion to the linear term looks like 


1+¥3 3 1+3 | 1 3 
mal + (1+ V¥3)(x - By = 4+ VDE + — AnD 
which is an expansion. For the fixed point x;, we have 
1-3 
ag = g(x)| 1_-/i = 5 
> aa 


a= 8@)|_g =1— V3 


and the expansion to the linear term looks like 


i- 
2 


1-43 nt - J+ O— - 20/9 


which is a reflection and contraction as before. In general, it depends on the nature of the function which method is easier. 


+ (1— V3) - 
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If it can be transformed by inspection, transforming the fixed points to the origin is preferable. In general, the Taylor series 
expansion method is more straight forward and simpler. 


II1.5.11 In exercise III.5.7 we showed that the entries of the diagonal matrix D satisfy the eigenvalue equation 
A? — tr AA + det A=0 


Using the quadratic equation, the solutions to this equation are 


= trA+./(tr A)? —4det A 
= 


Ai 


and the condition stated in the problem is that the discriminant of this equation be greater than zero, in order to have two 
real solutions to the equation. If the discriminant is less than zero, then the solutions are complex conjugates of the form 
A, =x +iy =re*!®, The matrix defines a transformation which contains a rotation by 6 and therefore cannot be put into 
diagonal form, although it can be written as a rescaling and a rotation. 

Turning to the problem of finding the matrix B, we separate out for a moment the normalizing factor of 1/ det B for the 
inverse of B, which will turn out not to matter, and write 


det B- BAB! =det B-D= det B-( 3) 


0 dz 
Letting 
a b 
‘=( 
we have 
_,__ (aed—afc+bgd—bhc a?—aeb+bah — gb’ 
a! eS ( ced — fc? + gd*—dhc caf —ceb+dah—dgb 

which yields 


ade —afc+bdg —bch=detB-i, 
adh + afc — bdg — bce = det B - iz 
a’ f — b’g —abe + abh=0 
d’g—c’ f +cde—cdh=0 
The first two equations when subtracted or added, merely return to us the result of exercise III.2.13, that the trace and 
determinant are invariant under this kind of coordinate change. Since only these are dependent on the det B term in front, 
we can safely ignore it in what follows, which is why it was carried separately. Rewriting the last two equations as 
a’ f —ab(e —h) — b’g =0 
c? f —cd(e —h) —d’g =0 


we may solve either one, solving the last one for c, we have 


d(e—h)+/d’(e —h)*+4fd2g 


= 4 eat Ve= mae, 
-! 
_ 4 wAt Veh +4fe 


7 =e 
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Examining the discriminant, we have 
(e—h)* +4 fg =e? +h? —2eh+4fg=e7+h*+2fg—2detA 
=e? +h?+2detA+2fg—4detA=e? +h? +2eh —4detA 
= (e+h)* — 4det A = (tr A)” — 4det A 
which is the discriminant of the eigenvalue equation. Putting this in, we have 
d trA+,/(trA)? —4det A d 
C= = (——  _ — fh) = 0 — A) 
f 2 - 


or 


fifa iF 
WEN a ee 


Since det B + 0 the columns are independent, so we must choose a different 1; for each of the above equations. Only the 
ratios are important here, so we choose a = c = 1, and have 


pa Mee =) (1 a 
fy =a 1 


Ag—h 
In closing, we note that the eigenvectors of A are the solutions to the pair of equations 


c( 


)=d. 


ex; + fx2=Ax, 

gx; thxz=Ax2 
which appears to yield the two solutions 

: a, =X2 ; ee xX) = X2 
- " (—h) 

These are actually identical, since by the eigenvalue equation 

Mer Oo a £ 

f Aah 


To solve for the eigenvectors, simply let x; = 1 and use one of the corresponding values for x2. Hence the rows of the matrix 
B found above are precisely the eigenvectors of A. This is true in general, (for N x N matrices the eigenvalue equation will 
have N real roots if it is diagonalizable), and depending on the problem presents another way to find a useful transformation. 


111.5.12 To solve this problem, we make use of the methods developed in exercise III.5.11. We have 


room(s 3)(2)+() 


Solving for the fixed point, we may rewrite this as 


ee x 0 
iv - a e ee sg i 


so there is a necessary translation by (0, —1/2) to bring the fixed point to the origin. To diagonalize the matrix we have 


(13) 


so that tr A = 4 and det A = —1. This gives us the eigenvalue equation 
7—4,—-1=0 
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yielding 
Aj =24V5 
We may now write down immediately the final form of the transformation, we want B and the above translation to produce 


fi!) = (7 ie : 5) 


which is indeed a coordinate rescaling. A matrix for B is then given in exercise III.5.11 by 


or to write out the whole expression for the coordinate transformation (x), 


vom (i Fe)(2)*() 
DG ea 


I1I.5.13 Let f be a transformation f:X — X. We want to show that {/”}°2, is a semigroup, which requires showing that 
f" 0 f": X — X is in this set of transformations. But 


fo f(x) = f"(F"(x)) = f(x) 
so the set of iterates of f is indeed a semigroup. 
111.5.14 Let T: & — & bea left shift transformation on code space, i.e. 
om Xperts... 
T Gey KoX5X... 
Since each application of T shifts the sequence to the left and drops the initial term, we have 
T"(X) = Xn i%ng2%n43.-.- 


The fixed points of T>(x) = x4x5... must therefore obey 


XfiX for fy owe =X f,% f5X fe saat 
Listing out the first few equalities, we have 
Xi =%4 
X2 = X5 
X3 = X6 
X4 = X7 
which yields 
X) = X3n+1 
X2 = X3n+2 
X3 = X3n+3 


In other words, any fixed point of T?(x) must repeat the first three symbols indefinitely. Explicitly, the fixed points of T3(x) 
are 


000, 001, 010, 011, 100, 101, 110, 111. 
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111.5.15 We need to show that the Mébius transformations on R form a group under composition. To check closure, we let 


vin BEE yee EL 
The composition is then 
at) +b 
W 1 0 W2(x) = oD) vg 
aex +af +bgx +bh 
~ Cex +cf +dgx+dh 
_yfger+ bg)x + (af + bh) 
~ (ce +dg)x + (cf + dh) 


which has the proper form. We check 
(ae + bg)(cf + dh) — (ce + dg)(af + bh) 
=aecf +adeh+bcgf + bdgh — aecf — bceh — adgf — bdgh 
= (ad — bc)eh — (ad — bc)gf 
= (ad — bc)(eh — gf) #0 
so the Mobius transformations on R form a semigroup. They are invertible as follows: Let 


epee dak 
~ ex+d 


We have 

wex +wd=ax+b 

wex —ax =b—wd 

—dw+b = dw—b 
— ewe—a- —cw+a 
which has the proper form, with 
da — (—b)(—c) =ad — bc #0 
so there is an inverse for each element, and they form a group. 
III.5.16 This problem is identical in algebraic manipulations to exercise 5.12, with the replacement of a, b,c,d,z,w€ Cc 
instead of in R. 
111.5.17 Let f(x) = Ax +1, and g(x) = Bx + u be invertible affine transformations on R*. Then if w = f(x), we have 
w—t=Ax;A'(w—-tp=x=A'w-Alt=f '(w). 
Similarly, g~'(w) = B-'w — Bolu. 
We have 
fog(x) = A(Bx +u)+t=ABx + (Au +1) 
which is an affine transformation. We need to check whether it is invertible both because it needs to be in our set of 
transformations, and to prove group structure. Let 
w= ABx + (Au +f) 
w— (Au+t)=ABx 
A7!(w — (Au +t)) = Bx 
B'A'(w — (Au +t)) =x 
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Rewriting the left hand side, we have 
f'(w) = B'A7'w — B'A7!(Au +t) = B'A'w — (Bu + BAD), 
which is affine. The inverse matrices B~', A~' must exist because by assumption f, g are invertible. Hence the set of 


invertible affine transformations is a group under composition. 


I1I.5.18 Suppose f(A) C A. This implies f"(A) C A, as follows: It is certainly true for n = 1. Suppose for every 0 < 
i < k, the statement holds. Since by hypothesis f*~'(A) C A, and f(A) C A if AC A, we have f(f*"'(A)) C A. But 
fCf*"'(A)) = f*(A) so the claim is true by induction. To show that this implies that {f"} is a semigroup, recall that 
f"0 f”™ = f"*™ which is in the collection. Hence { f”} is a semigroup of transformations. 


III.5.19 Let w(x) = Ax +t where 
a 0 
an(3 2) 


and ac # 0. This transformation is invertible with x = A~'(w — ft), as long as A~ exists. This is the case since det A = 
ac # 0. To show that the set of transformations of this form are a group under composition, we need to show that if B, A are 
of this form then BA is as well, and, if A is of this form then A~ is as well. 


Let 
a 0 e 0 
A=(‘ °) and #5 .) 


e 0 a 0 ae 0 
pa=(‘ ae 2 ewe 3 


which is also of the proper form. We have 


Then 


ae -cg = (ac)(eg) #0 


1 ce 6 
At = — 
als of 


which is of the correct form. ca = ac, so the inverse is in the set. Hence the set of all such transformations w(x) on R? forms 
a group. 
I1I.5.20 Given f(z) = Az? + Bz + C, we want 0(z) =az + b such that 


Go fod '(zy=(Z’P+C 


so the transformation is in the set. Furthermore, 


Since 
1 
9-"(z') = —(z' — b) 
a 
we have 
A B 
fod (z) = ale. — 2bz' + b*) + —(z-b) +C€ 


A 2bA., Ab? Bb 
= 2" + (B- —)e¢ +(— - — +0) 
a a2 a2 a 


Composing 9(z) with this yields 


A bA Ab? 
60 f 00 "@=—2 + 8-20 + (—a — Bb+Ca+b) 
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We must have A/a = 1 and B — 2(bA/a) = 0 for the desired transformation, meaning a = A and b = B/2. Using these to 
simplify the constant term we have 
st 


B . 
O(z)=A — eet Ae ae mee 
(z) z+ 5 C=C +5 rl 


Answers to Chapter Ill, section 6 
111.6.1 
(a) sup(—oo, 3) = 3; inf(—oo, 3) = —oo. 
(b) sup(C) = 1; inf(C) = 0. 


(c) sup{1, 2,3, 4, ...} =+00; inf{1, 2,3, 4,...}=1. 
(d) sup(R*) = +00; inf(R*) =0. 


III.6.2 Let f : X — X be a contraction mapping on (X, d), and let so = inf{s € R:s is a contractivity factor for f}. Then 
So is a contractivity factor for f. 

Proof: Clearly 0 < s < 1 since if s is any contractivity factor for f, then 0 < s < 1 so that 0 < so <s < 1. As well, the 
infimum is taken over a non-empty set since there is at least some s which is a contractivity factor for f by definition. 
Consequently, we need only show that Sp satisfies 


d(f (x), f(y)) < 50-d(x, y) Vx, y EX. 
Suppose not. Then there is a pair x, y € X such that 
d( f(x), f(y)) > 50+ d(x, y). 
But this says that 
d(f (x), f(y)) — 50-d(@x, y) > 0 
and since 
d(f (x), f(y) — dQ, y) <0 
by the intermediate value theorem, there is an s’ with 1 > s’ > so such that 
d(f (x), f(y) —s’- d(x, y) =0 


Hence s’ is less than all valid contractivity factors, and there is an interval (so, s’) which contains no values which are valid 
contractivity factors for f. This is a contradiction of the assumption that so is the infimum of the contractivity factors for /. 


III.6.3 Let f :X — X and g: X — X be contractions with contractivity factors s,t respectively. Then the contractivity 


factor of f o g is st. 
Proof: For all x, y € X we have 


d(g(x), g(y)) <t-d(x, y) => s -d(g(x), g(y)) < st - d(x, y) 
But we also have 
d(f(g(x)), f(g(y))) <5 - d(g(x), g(y)) < st - d(x, y) 


which by definition makes st a contractivity factor for f(g(x)) = f o g(x) as desired. 
11.6.4 Let w(x) = Ax +t: R? — R? with fixed point x, and det(A — /) #0. Let h(x) =x — x7 =x’. Then h“!(x’) = 


x’ + xy. Represent 
_fa b\ fx € 
wora(e a)(a)+(y) 
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and 


ox *) 
xXxr= 
: & 
wore) =(4 A be Le “) 
C GI XA Sie £ 


be + axs, + bx, + bx5, ee 


cx, +cx,s, + dx, +dx,, + f 


ax, + bx, +xy, 
cx, +dx,+Xxz, 


Then we have 


because x; is a fixed point. Then 


howoh"(x')= * + bx, reg Sn (*) 


ez, +x, + X% tj, 
s & + 2) 
cx, + dx, 
= Ax’ =w’(x’) 
Inverting this we have 
howoh(x) =h"' ow'(a — xp) =h (A(x — x7) = A(X — xf) +44 


Finally, we check that A(x — x) +x = Ax +1. Since 
Xf; ae ax; + bx, +e 
X fo cxs, tdxy,+ f 


ax, + bx2 — (axyz, + bxz,) + (axyz, + bxy, +) 
chy ada (08, + ie) + (cx 2, + 5, + Ff) 


we have 


_ { ax, + bxz +e 
= cx, + dx, + f 


=(c a)la)+y) 


w(x) = w!(x) = Al (x =~ Sey ty 


To prove the next claim we have 


and 
w(x) = A(w(x) — Xp) +xy 
= A((A(x — x) +X) — Xp) +x¥ 
= A(A(x — x¢)) +x¢ 
= A*(x — xy) +x, 
Suppose that 


w"(x) = A(x — x¢) +25 
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Then we have 
w"t!(x) = A((A"(x — x4) +x) — xp) + 2x5 
= A(A"(x — x4)) + x 
=A™*(x — xp) +27 
so the result follows by induction on 7. 


We now describe the conditions on A under which the map is contractive for the Euclidean metric. We want d(w(x), 
w(y)) <s-d(x, y) for some 0 < s < 1. This means, 


d((A(x — x7) + X¢), (A(y — x7) + 5)) 5+ dX, y) 


or 
(ax, — ax, + bx2 — bxp, + x4, — ay; t+ axz, — byr + bxp, — ee 
+ (cx, — cxy, + dx2— dx +x, — cy +x, — dy2 + dx, — x4) 
= (ax; — ay, + bx2 — by2)? + (ex; — cy + dx2 — dyn)’ 
= (A(x — yi)? + (AQ — y)2)” 
Using the notation of exercise III.2.14 we need 


|A(x — y)| <s-|x-y| 


or 


A(x — 
AG @- <sVx,yexX 
|x — y| 


This is certainly true if we maximize over |x — y| so we have as a condition that |A| < s < 1 for w(x) to be contractive. The 
proof is identical with the Manhattan metric substituted and the appropriate changes made. 


III.6.6 Let f : R > R be given by f(x) = 1/2x + 1/2. Then f is a contraction mapping as follows: 


1 i 4 1 
d(f (x), f(y) = a(5x + 5°95” + >) 
1 


1 1 ] 
Te 5 (Ge eel 


so 3 
— 
Slat = 5)! 

1 
=o 
Sod 
= b 


so that s = ; is a contractivity factor. To find the fixed point, we write 


1 1 
so 3 
1 1 
—- 


or x¢ = 1. Rewriting in the form f(x) = A(x — Xf) + Xz, we have 
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f@)= 5-141 
f@=—0-I4l 


1 
lim f"(x) = lim —(x — 1) +1 
n—->0o n—oo 2” 


With x =0, f"(x) = 1 — 1/(2"). 
Writing the original, 
if 1 ae ee Se 
Tey ae Sea 
Qa 5G tots ee ae 
If 
ee gee OE 1 —1 1 
J GesU ae ae 
then 


Lil 1 1 
3 ee eas i 24 
f =5() ai t ets 


i=] 


: 1 1 1 
=y ae. 
~ £2 Qntl s, 
ntl 4 ] 
Sg Sait 


re | 


so the formula for f”(x) follows by induction. In the limit, 


independent of x. 


TiT.6.7 Let (X,d) be compact, and f : X — X be a contraction with contractivity factor s and fixed point x. Then 


F(X) CX, f(X) FX. 


Proof: (X, d) is compact so it is bounded. In particular, since the function g(x) = d(x,, x) is a continuous function from 
(X, d) to R, it has a maximum M, and a point xy € X such that d(x, x4) = M. Then for all x € X, 


d( f(x), f(xs)) =d(f (x), xs) 
< d(f (xm), xf) 
<s-M<M 


hence there is no x such that f(x) = xy, and f (X) # X. 


11.6.8 The set of contraction mappings on (X, d) are a semigroup under composition. To see this, we only need to know 
that if f, g are contractions then so is f o g. By problem 3, if the contractivity factors for f, g are s, t respectively, then so 


is f og with contractivity factor st. 
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III.6.9 Let w(x) = Ax + t where 


_ {5008120° —5 sin 120° 
~ \dsin120° —} cos 120° 
and t = ; . The easiest way to see that w takes the equilateral Sierpinski triangle to a subset of itself, and is a 


contraction is to break the transformation apart. The equilateral Sierpinski triangle is expressed by the following IFS: 


Each transformation shrinks the triangle towards one of its vertices by a factor of 2, the vertices are (0, 0), (1/2, /3/2), and 
(1, 0) respectively. Looking at w we may rewrite it as 


xy cos120° —sin120° \ / x; : 
Ww = = 

x2 sin120° cos120° x2 0 
+ 0)\ (cos120° —sin120°\ (x; Z ; 
0 3 sin120° cos 120° Xo 0 


| ee 8 x 1 
=(6 )@(Z)+(6) 


where Re is a rotation by 6 = 120°. For the whole attractor A, the map therefore rotates A by 120° about its left lower vertex 
(O), by an angle equal to the exterior angle of an equilateral triangle, depositing it as an equilateral Sierpinski triangle on the 
left side of the y-axis with vertices (—1, 0), (0, 0), and (—1/2, J3 /2). The next action is a contraction, since it is identical 
to w, and creates, setwise, a mirror image of w,(A) on the left side of the y-axis. Finally, there is a translation to the right 
by 1/2 bringing the image to a half-sized equilateral triangle with vertices (0, 0), (1/2, 0), and (1/4, ./3/4). In other words, 
it lies exactly (setwise) on the image w,(A). So the map does indeed contract the triangle, and take it to a subset of itself. 
The fixed point of w can be calculated by 


x, \ _ (3¢08120° —}sin120°\ (x, (12 
x2) ~~ \$sin120° —}.cos 120° x2 0 
fe Fi 
ae ¥3 Et | x2 0 
eee 


WwW 


which yields 
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ant qed 
ge. ing 
1 <7 
J3 
—_ 
10 
ay 
2/3 
are 


III.6.10 Let © be a code space on the two symbols {0, 1} and f(x) = lx = 1x,x2.... Then using the metric 


[Xi yi 
dé.= >. ou 


n=] 
we have 


Se Sy ne | 
d(f (x), f(y) = 22 aoe ag : 3 ; 
n=1 


so that f is a contraction mapping with contractivity factor 1/3 and fixed point 1. 
II1.6.11 Let (X,d) be a compact metric space, and f : X — X be a contraction mapping. Then {f”"(X} is a Cauchy 
sequence in (H((X), A) and lim,_,.. f”(X) = {x+} where x, is the fixed point of f(x). 

Proof: We first verify that { f (X)} is in fact a sequence in 7((X). Since f is a contraction mapping, f is continuous, and 
the continuous image of a compact set is compact. Hence f (X) is compact. By induction, f”(X) is compact. Furthermore, 
since x, € X and f"(x-) =x, € f”(X), f”(X) is non-empty. Hence, f"(X) € H(X) for all n, so this is indeed a sequence 
in H(X). We next verify that the sequence is Cauchy. If so, since X is compact, X is complete, and therefore so is 71((X). 
Hence if the sequence is Cauchy, it has a limit in #{((X). Let s be a contractivity factor for f(x), and let e > 0 be given. 
Since s < 1, we may choose WN such that for any n > N, s” < 5 where 6 is a number of choice less than 1. Let n,m be 
chosen n < m, andn, m > N. Since m > n, we have f”(X) C f”(X), so that 


h(f™(X), f"(X)) = max d(x, f”(X)) 
xe f"(X) 
But this must be smaller than the diameter of f”(X) defined by 
diam(f"(X)) = max_ d(x, y) 
x, ye f"(X) 
We have 
diam( f”(X)) < s” diam(X) 


since s is a contractivity factor for f. By choosing N such that s% diam(X) < €, we guarantee that 


Selected Answers 463 


h(f™(X), f"(X)) = = d(x, f™(X)) <s” diam(X) <e 


because all the distances considered for the maximum are less than diam(f”(X)). Hence the sequence is Cauchy and has a 
limit in 7((X). 
We now show that that limit is x ;. We know that x¢ € limp... f”(X). We need only show that 


Jim diam(f"(X)) =0 
But this limit is bounded by 
im diam(f”(X)) < Jim s” diam(X) = 0 
Hence, if y € limy-.0. f”(X) then d(y, x) = 0, but d is a metric, so this implies y = x. Hence 
lim f"(X) = (x7) 
as desired. 


I11.6.13 We are asked to show that if A = @ 4 is strictly positive definite (that is a, b, c, d > 0) then the transformation 


A foe, _ fa al 
sin 0 sin f (@) 

is a contraction mapping, where A is some positive number. Since we may scale A to fit the matrix A, we first check the case 
where det A = 0. This happens if the rows or columns of A are multiples of each other, and we have f(@) = 6; a constant, 
so in this case the function f converges on the first application. Otherwise, by rescaling A this will work for any strictly 
positive definite A so long as it works for a strictly positive definite A such that det A = 1. We need to have real eigenvalues, 
so that tr A =a +d > 2. We have ad — bc = 1 or bc = ad — 1 and bc > 0 so that ad > 1. Suppose ad = 1. Thend = 1/a 
and we have a discriminant 


f (@) defined on the first quadrant by 


1 
a’ +2ad+d’?-—4=a°+—-2 
a 


The expression 


has derivative 


q.. 1 2 
oF oe her Baca 
a + 3) a? 


showing zeroes when a* = 1 which since a > 0 occurs when a = 1. This is a minimum, and yields a discriminant of 0, but 
because the determinant is 1, this would yield bc = 0, which it can not be, so that we do indeed have such a A, and one 


choice is greater than zero. 
To show that f is a contraction mapping, we must show that 


d(f (1), f (@2)) < 5 -d(O;, ). 


The sine of an angle is an increasing function in the first quadrant, so writing 
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sin( f (6;) — f (@2)) = sin f(A) cos f (@,) — cos f(6;) sin f (2) 


1 
7 (ac cos 9; cos 0, + bc cos 6; sin 62 + ad sin 6, cos 62 + bd sin 6, sin 6;) 
1 
— 32 (ac cos 9; cos 62 + ad cos 0; sin 62 + bc sin 6; cos 6, + bd sin 6, sin 62) 


1 
“> (ad(sin 6, cos 62 — cos 6; sin 02) — bc(sin @; cos 82 — cos 6; sin 6) 


det A 

= = (sin 6; cos 6 — cos 6, sin 62) 
detA , 

= sin(6, — 02) 


so that, to get a contraction mapping, we need 


sin(f (61) — f(2)) _ det A _ 
SS see 


To test this, recall that assuming det A = 1, we have tr A > 2 and 


A  /(trA)?—4 
j= — 
2 2 


indicating A > 1. So f is a contraction, implying that { f"(@)} — 6, as desired. The fixed angle is in fact the angle of the 
eigenvector, yielding 


A- 
0; = arctan(———~) 


Answers to Chapter Ill, section 7 
III.7.1 We have w,(x) = 1/3x and w2(x) = 1/3x + 2/3. We compute the contractivity factor for the IFS. 


1 1 1 1 
d(w,(x), wi(y)) = I5* = 3)! = 3)" —yl= 34, y) 


1 2 1 2 1 1 
d(w2(x), w2(y)) = (3 + 3? _ (3y + 3 = 3h —y\|= 3a. y) 


52 => 


ro 

ee) 

We prove that the attractor of this IFS is the Cantor set by proving by induction that the action of W(B) is to remove the 
middle third of each interval in B. To this end, our hypothesis is: Let [a, b] be a component of B, with k < n. Then 


Ss = max(s\, 52) = max( 


[a,a+ (6 —a)|U[b-—- 306 —a),b) C Bea 
and 


1 l 
(a+ 7O—a),b = FO> a) OB i=0 
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We establish that this is true for the first case: Let Bp = [0, 1]. Then 
1 2 
B, = W(Bo) = w, (Bo) U w2(Bo) = [0, 3) U [3 1] 


Now suppose that [a, b] C B, is a component of B,. Then there is a component [a’, b’] C B,_; whose image is [a, b] under 
some w; (i = 1, 2). By hypothesis, the component [a’, b’] has had its middle third removed in B,. Then [a, b] is replaced in 
B,+1 by the set 


1 1 
wi(la' + (0 — a) UL! — 5 —a')) =[a + 5(6— a) ULb — 5(6—a)] 


since under w; the length b’ — a’ +> b — a and clearly the two points a’, b’ +> a, b respectively. Hence this operation is a 
removal of middle thirds, and the induction goes through. Then the attractor of the IFS is the classical Cantor set. 


TIL.7.2 We have w(x) = 5;x and w2(x) = (1 — 5;)x +.5;. Let Bo = [0, 1]. Then 
B, = W(Bo) 

= w;([0, 1]) U w2([0, 1) 

= [0, 5,] U[s},1 — 5; + 5;] 

= [0, 5;] U[s;, 1] = [0, 1] = Bo 
Hence By = [0, 1] is the attractor of the IFS. 
I11.7.3 Let w;(x) = 4x and w(x) = $x + 4. Let Bo = (0, 1]. Then 

B, = W(Bo) 

= w}([0, 1]) U w2((0, 1) 


Load 
=[0, s1U [5.1 


We have, at least on the first iteration, an operation which removes an interval of length t from an interval, with this removed 
interval being just to the left of the center of the interval. We test this by applying the induction hypothesis as in problem 1. 
Assume that for each k < n, any component interval in B, has a ; slice taken out to the left of the halfway mark, specifically, 
[a, b] goes to 


1 1 
[a,a+ 30 —a)|U[b—- 50 *@),D] 
and 
1 1 
(a+ 40 She 5 — a) Best = 
Then if [a, b] is a component of B,, then it is the image under one of the maps, say w; of some interval [a’, b'] C B,-;. In 
B,, this interval has been replaced by 
1 1 
[a’,a'+ hd —a’)]U[b' — 5b —a’), b' 
the image of which under w; is 


[a,a + 5(b—a)]U[b— 5(b—a), 


which says that that B,4+; is derived from B, by taking out the : slice. A is a Cantor set, with uncountably many points, and 
no intervals. To see this last, if [a, b] were a component interval of A, then since W(A) = A, this operation does not change 
[a, b] but the above proof says it must lose a slice, hence [a,b] CA>a=b. 

111.7.5 Let w;(x) = ax + b and w(x) = cx +d as in the statement of the problem. The attractor A of the IFS is compact, 
hence it is bounded, and acheives its bounds. Then there is a closed interval [a, b] C R such that a = min{x : x € A} and 
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b = max{x : x € A}. Suppose 
w1([a, b]) N w2([a, b]) #9 


Then since [a, b] is connected, and the w; are continuous, w;({a, b]) and w2([a, b]) are connected, their intersection is non- 
empty, hence w;({a, b]) U w2([a, b]) is connected. Furthermore, there are points a), b, € AC [a, b] whose image in one or 
the other map is a, b respectively. If there were any points in one of the images which were outside the interval [a, b], then 
the image of either a or b under one of the maps would lie outside of [a, b] and therefore not be in A which it is. Hence this 
image, w;({a, b]) U w2([a, b]) is mapped to itself exactly by W and is therefore the attractor A itself. Thus A is connected if 
w ({a, b]) N w2([a, b]) 4 B. Suppose w)([a, b]) N w2([a, b]) = 9. Then it follows since A C [a, b] that w,(A)N w2(A) = Q, 
as well. Let [a’, b’] be the longest connected component of A. Since w;(A) 1 w2(A) = G, there is ani € {1, 2} and an interval 
[a”, b”] (connected component of A) such that 


w;([a’, b")) = [a’, b’] 
Let s; < 1 be acontractivity factor for w;. Then 
d(a’, b') < s;d(a”", b") 


which says that either [2”, b”] is longer than [a’, b’] or the length of each is zero. The first contradicts the statement that 
[a’, b'] is the longest connected component of A, consequently a’ = b’, and the only connected components of A are points. 
Hence A is totally disconnected. 

III.7.6 No. The reason for this is as follows: Because each map must have a contractivity factor of less than one, each map 
must take the proposed attractor A, that is the union of two disjoint intervals, to a subset of either one interval or the other. 
The image of each map will be the union of two disjoint intervals itself. Consequently, let w, take A into the first interval, 
and w take A into the second. This situation must arise (possibly by renumbering the maps. It is possible to generate a 
single connected interval of either of these two images with w3, constructing one of the intervals of A. It is not possible to 
cover the gap in the other, since the image of w3 must lie entirely in one interval or the other. (Note that the attractor could 
be constructed from four maps). 

III.7.7 Let {R? : w;, w2} be an IFS with w; given by 


Let the attractor of the IFS be A. Then (x, y) € A implies that x = y, and x, y € [0, 1]. 
Proof: Let B = {(x, y) € R?: x =y, x, y € [0, 1]} Then 


(5) (843) 
as ee 1 
y tes 
! 
x =X 
y 27 


Since by supposition, x, y < 1, 1/2x,1/2y < 1/2 so adding 1/2 to each remains in (0, 1]. Furthermore, if x = y then 
1/2x = 1/2y and 1/2x + 1/2 =1/2y + 1/2 so the image of B is contained in B. The image of B under w; is {(x, y) € 
B:x,y €[1/2, 1]} exactly. This is because 0+> 1/2 in each coordinate, and 1 +> 1 in each coordinate. The image of B 
under w2 is {(x, y) € B: x, y € [0, 1/2]} exactly as well. Hence the image W(B) = B, and since the attractor is unique, 
B= A, 

III.7.8 We show that the attractor is the point {1}, together with the images w3(0). That these points are in the attractor of 
the IFS follows from the fact that the two fixed points 0, 1 must be in the attractor, and therefore all forward images of them 
must be as well. Let 


A={xER:x=1lorx=w>O); = 0; 20:00} 
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Let x € A. Then w;(x) = 0 = w9(0), if x = 1 then w2(x) = 1, and otherwise there is a k such that w3(0) = x so that w2(x) = 
wkt!(0). Hence W(A) C A. Furthermore if x = 0 then there is an element of A (for instance 0) such that x = w,(0), ifx = 1 
then x = w2(1), and otherwise if x = ws (O) then x = w (ws '(0). Consequently W takes A onto itself as well. Therefore 
W(A) =A. A is a countable sequence of increasing points by definition. We want the geometric series for x,. Writing 
w(x) = 2/3(x — 1) + 1 we have w3(x) = (2/3)"(x — 1) + 1. Substituting x = 0, we have x, = 1 — (2/3)”. To put this in 
the form of a geometric series, 
2 2 
=— ol — (y=) *~y\a-1 
Xn — Xn-1 Gy ete 
22 
- 1 3 ee SS 
( 3 )( 3) 
lL 
= (3) 


We may thus write the terms x, as the partial sums of the geometric series 


I< 2 
7 2G" 


Answers to Chapter Ill, section 8 
There are no answers to section 8, all the exercises are programming experiments. 


Answers to Chapter Ill, section 9 
I1I.9.2 We assume that wo(B) = C for any B € H(X). We have 


(5)=(°o° o2s)(3)+("o") 


To show that {R? : wo, w;} we need only establish the contractivity factor for w,. We may rewrite w, using multiplication 


y y 


d(w,(x), wi(y)) = V(.75x1 + .25 — .7S5y; — .25)* + (.75x2 — .75y2)? 
= V (.757((x1 — yr)? + (X2 — Y2)?)) 
.T5/ (%1 — yi)? + (2 — ya)? 


.15d(x, y) 


so the contractivity factor for w, is 0.75, the map is contractive and this is a hyperbolic IFS with condensation. Let Ap = C. 
We calculate w{(Ao) by using the A(x — x+) + xy form of the transformation. First the fixed point. 


xf, = 15x 4, + .25=> 25% f, a 25 => xf, = 1 
Xf, = .75Xp, > Xf, =0 


“(;)=orr(*5")+(0) 


Rewriting w/ as 
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we see that wi(Ao) is C shrunk by (0.75)” and moved to 1 — (0.75). In other words, as the figure shows, the trees get 
smaller, closer together, and closer to 1. By example 1, 


A=|)W"(Ao) 
n=0 
with 
A,;=CUw,(C) 
A, =C Uw (C Uw,(C)) =C Uw, (C) Uwi(C) 


Suppose that for k < n we have 


k 
Ar =|) wi(C) 
i=0 
then 
An+1 =CU w (A,) 
=CUw (Y vic) 
i=0 
n+l 
=C UL teacy 
i=l 


n+l 


=|(Juwi() 
i=0 


Counting the number of transformations of C for A, produces n + | trees as desired. Let C be the trunk of the tree. That is 
wWo(B) = C for any B € H(X). We define w;, w2 so as to form the two branches off the trunk, as follows: 


x\ _(rcos@ —rsin@ . 0 

*I\ yy) ~\rsin@ rcosé 1 

oiv yas @  rsin@ 4 0 

*\y)  \-rsin@ rcosé 1 
In other words, w, rotates the trunk by @, shrinks it by r and moves its base to (0, 1). w2 rotates the trunk by —@, shrinks it 
by r and moves its base to (0, 1). Successive applications of either of these transformations rotates the trunk by -£n6,shrinks 


it by r”, and moves the base to the top of the last piece. The figure becomes disconnected if the translation moves the piece 
above the top of the last trunk, it becomes not simply connected if r is close to 1 and @ is large. 


III.9.8 Without the ‘cls’ instruction we get effectively the union of W applied to the initial set. By example 1, this is the 
attractor of an IFS with condensation, the condensation set being the initial set chosen. 


Answers to Chapter Ill, section 10 


III.10.10 We make reference to the general form of the affine transformation expressed in terms of the fixed point derived 
in exercise III.6.4. That is, for a contractive affine map, we have 


f(@) = Ax. + f= AG — X;) +47 
F(x) = A] —2x;) + x; 
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which we regard as the canonical form of a contraction mapping. We have, in this case 
wj(zZ) = s; + (1 — $;)a; = 5;(z — aj) + a; 
Regarding multiplication by a complex number as an R? transformation, namely 
SiZ = Si,Z1 + 1S;, 22 + 1SinZ1 — SinZ2 
= (5;,21 — SinZ2) + 1(S;,22 + 5:21) 
re) 
Sin Si, 22 


this transformation is equivalent to the R? transformation 


“CeCe S)G=3)+@) 
y Sin Si, y — Gj, ai, 


from which we can read off the fixed point as a;. Note that s; is a rotation and scaling, making the whole operation a 
similitude. 


Answers to Chapter Ill, section 11 


III.11.3 In order that the attractor of an IFS be two lines joined at the origin, and variable with a parameter p, we may pick 
three maps as follows: Let w,(x, y) shrink the angle towards the origin with no rotation. This creates the angle again, with 
smaller clock hands. To get the hands back to the same length, create two maps which map the whole attractor to the tip of 
each hand, that is to a line at the same angle as each hand, with a fixed point at the end of the hand. Consequently, choose 


w, to be 
\ ee 
wie =(4 a 


which shrinks the clock face slightly. To collapse the attractor to a line of slope 0, we need 
te Ol ik\. { ax +by 
wey=(2 5 pare 


cx + dy 
ax + by 


such that 


tang V¥ x,y 
This is most easily accomplished by starting with the relations 
cid 
— =—=tand 
a b = 
for which one solution is the matrix 
A=(°°s 6 cosé 
~ \ sind sind 
This matrix has det A = 0, and generates the proper slope. Now suppose we want the length of the new line to be less than 


1. The clock hands can be any set of lengths, we will take the minute hand to be length 1, and the hour hand to be length 
1/2. Choosing the minute hand itself, we have 


A (088 \ _ cos’ @ + sin 8 cos 6 
sind) \ sin?6 +sin@ cos 
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so that the right hand side is of length less than two. Add to this the possibility of the whole other hand, which must be less 
than one, and we have to divide this matrix by 3 to assure that it fits within the given hand. So let 


Fo + cos ; cos 
ae + sin + sind 


Now we arrange to put the fixed point at the end of the hand. 


cos 6 cos 8 e€ 
eg “Al cae )* e 


or 
cos*@  sin@cosé@ 
e=cos@ — — 
3 
39 : 
6 6 cos @ 
PEE 7 _ sin 


We now let 6, = 2/2 — (2/6) p be the hour angle, which increases clockwise from 2/2 so that it reaches 27 when p = 12. 
We let 6,, = 1/2 — (2/2) p which travels the same direction but reaches 27 every hour. Having arranged that 


Wm(X, Y) = Alex (5) > (“") 


will be the map to the minute hand. The map to the hour hand is similar, but we must shorten the hand by a factor of 2. So, 


wate, = An(*) + (4) 


cos?@ sin, cos 6) 


where A; = 5A and 


e=cosG — 


6 6 
=) - 
sin’@ sin 8, cos, 
ey 6 _ - 
i g Sin 0), 6 6 


Answers to Chapter IV, section | 


IV.1.1 The addresses are a = 2121122, b = 333, and c = 12231222. the height, and half the width, and then, depending on 
the map, moves it to the appropriate corner of the square. To shrink the square by one half on each side, we define the map 


“(S)=( )G)=4G) 


and assign the symbol 0 to an iteration of this map. To get the quarter labelled 1, we must do the same shrink, and move the 
lower left corner over by 1/2 in the x direction, which gives the map 


o(3)=4(3)+(3) 


An iteration by w2 corresponds to an address symbol of 1. The third map must move the lower left corner to (1/2, 1/2), 


consequently we define 
1 
#(3)-4G)+@) 
y y 3 
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An iteration by w3 corresponds to the address symbol 2. Finally, the last map moves the lower left corner up by 1/2, giving 


(3-43) +) 


and an iteration by this map gets the address label 3. 


IV.1.4 The easiest way to see this problem is to look for a subset of the IFS maps which have as their attractor one of the 
sides of the Sierpinski triangle. Consider the IFS {R*; w,, w2} giving rise only to addresses which contain no 3’s. If a point 


is on the x-axis, neither of the two IFS 
ity 5 0\ (x 
Vy 0 t)\y 


(5) =(3 3)+(6) 


Either of these two will return a point on the x-axis to the x-axis, and their attractor is the line between 11 and 22. On the 
other hand, any point with a y component not equal to zero will on a single iteration have a new y value equal to half the 
original, which is still non-zero. Hence this edge of the Sierpinski triangle consists of all points whose addresses contain 


only 1 and 2. 
We may use identical arguments to verify that the points along the left edge of the triangle have addresses containing only 
1’s and 3’s. The final edge contains all addresses containing only 2’s and 3’s. 


IV.1.6 One scheme is to use the Cartesian product of the two codespaces. Let the IFS with condensation be {X; wo, w,..., 
wy} and the IFS whose attractor A is used for the condensation set {X; v1, v2,..., vw}. The attractor B for the IFS with 
condensation is then the union of the successive iterates 


co 
B=|Jw"(A) 
n=0 
Any point on B may be described by giving the sequence of iterates done on A to get to the particular image on B of A, 
combined with the address on the image, which corresponds to an address on the attractor A. Consequently, we have two 
code space strings, possibly on two code spaces, which give the whole address. Any address can occur. 


IV.1.8 This is part of the proof of Theorem 1 of section 4.5. Let {X; wi, ..., ww} be the IFS in question, let its contractivity 
be s, and let its attractor be A. Consider the images of A for two points which are near by in code space. That is let some 
small € > 0 be given, and let x = x;x2x3... and y = yi y2y3... be two code space points d(x, y) < €. The points in A having 
these addresses are contained in the images 
X € Wy, O Wx, O +++ Wy, (A) 
¥ E Wy, O Wy. O-+ + Wy_(A) 
Now, for € small, we may find a number 7 such that 
l 
<< E 
(N +1)" 
which means that x; = y; for all i < n. Then both points x, y are contained in 
Wx, ) Wx> asks Wx, (A) 


since the contractivity is s this image must be smaller than s” - diam(A). Hence the closer the addresses in code space, the 
closer the points having those addresses are on the attractor. 

IV.1.12 The scheme which is used in Figure IV.99 is the following: Write the numbers with the same denominators in 
horizontal lists beginning with the denominator 1, and putting each successive denominator under the list before. Starting at 
the upper right hand corner, go down if we have just arrived at the left edge, go right if we have just arrived at the top edge, 
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go up and to the right if we are leaving the left edge, or have just gone right and up, go down and to the left if we are leaving 
the top edge or have just gone down and left. This winds us through the rationals as shown below: 


l Z 3 4 5 

1 ee 2 ee | 

ee Lf Po e. 

1 2 3 4 5 

i, 2 2 2 2 
Lf fn * 

1 2 3 4 5 

3 3 3 3 3 

ae L 

4 


Fir 
Iw 
- SIS 
fin 


There are two things to note: The scheme works regardless of the entries as long as every rational is in some horizontal 
list, with the lists ordered in a countable manner, and, all redundant terms are not in lowest terms. Since any horizontal list 
becomes ‘shorter’ but still infinite if such terms are removed, we may use the same scheme if we agree to write down only 
those numbers in lowest terms in each horizontal list. To get the negative rationals, put the negative list corresponding to 
each positive one in below it. That is, the second row is numbers with denominator —1 the fourth with —2 and so forth. This 
method of converting the rationals to a linear list indexed by the counting numbers was invented by Cantor. 


IV.1.13 The triangles in the Sierpinski triangle can be labelled by size: There is 1 triangle of size 1, 3 of size ; and in 
general 3” triangles of size tr since on each iteration of W(A) we generate 3 times as many triangles of ; the size. Suppose 
we label each of the triangles with size ;; with the labels 


a +19 +2, .. eee 


Since this last term is 2 - 3" < 3"*', each is a unique label, there is a label for each triangle, and the collection of labels is a 
proper subset of the positive integers. The number of triangles is therefore countable. 


Answers to Chapier IV, section 2 
IV.2.1 The IFS {R; 5x, 5x + 4} is just touching. Let A be the attractor, and let O be the set 


] 1 
O= (0, DNAUC, IMA. 


Then we have 


) 


yu ] 
x2 


] 
w1(Q) = (0, 4 


i 3 3 
w2(Q) = (3 2 U (7 1) 


which are disjoint. By construction 


nw;(Q) O 


T=! 


Furthermore, © is open because it is the finite union of open intervals. The IFS is not totally disconnected because it has the 
attractor 


l 1 
A= (0, 1] = (0, 5] U [5 1] = w;(A) U w2(A) 


and w,(1) = w(0). 
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IV.2.2 The attractor of the IFS {IR; $x, 3x + 4} is [0, 1] as follows: 
1 


l 
wel, ESTs 
w,([0, 1]) U w2([0, 1]) = [0, 1] 


Since there is a unique attractor with the property that W(A) = A this is it. Furthermore, there is an open set (i, +) interior 
to the overlap between the images, that is 


LN ey ppA pares 
(75) C wil )N w( J=17> 5] 


so the IFS is overlapping. 


IV.2.3 The first thing to show is that the attractor of this IFS is (0, 1], as follows: For any affine transformation f(x) = 
ax + b on R, an interval transforms to either 


[lac+b,ad+b] ifa>0 


eis es ifa <0 


For each w,, here, a = + > 0, so we have, 


n—l nr 


w,([0, 1]) = seers To! 


Thus the union of these is 


10 1 9 
W(10, 1) = LJ wa(l0, 1) = (0, IU VI, N= 10, 


n=) 


Since the attractor A is unique, this is it. The IFS is clearly not totally disconnected because w, ({0, 1]) MN wn41((0, 1]) = {75}. 
However, let 


ze ox 
ee 


Then 
n—li nv 
wn(Q) C w,((0, 1)) = —p° 10? ae 
and for any i 4 j 
ih oe ie 6 
w;(O) 0 w;(O) C w;((0, 1)) Nwi(@, 1) = —. =) n ae = =o 


So the attractor is just-touching. 
IV.2.4 Our IFS is {[0, 1]; wi = 4x, w(x) = 4x + $}. The attractor A is a subset of [0, 1], and w,((0, 1]) = [0, +] and 
w2([0, 1}) = [§, 1] so that w, (0, 1]) N we([0, 1]) = 9 Then 


w (A) MN w2(A) C wi (0, 1]) N w2([0, 1) = 0 


so the IFS is totally disconnected. 


IV.2.6 We have w;(x) = 4 and w(x) = 5. The attractor is clearly A = {3}. The images of the two maps intersect, so the 
IFS is not totally disconnected. Furthermore, since the entire attractor is in the intersection there is clearly no open set O 
satisfying the conditions for a just-touching IFS. Consequently the IFS is overlapping, by default. 
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IV.2.8 The smallest number with no non-zero entries in its decimal expansion is .1. Any other choice, with non-zero entries, 
must have at least one digit which is greater than 1. Reducing this digit to 1 produces a smaller choice. Hence reducing all 
of them to 1 produces the smallest. 

IV.2.10 Let {X; w), w2,..., wy} be our IFS with attractor A. We first prove J] <=> I]. Let x be a periodic point. Then by 
definition there exists a finite sequence of transformations Wo,1+++, Wo, Such that 


Wo Qe° “9 We AX) = ¥ 


1 
If we define f(x) to be this transformation, we have 
f(x) = f(x) =X = (We, 9 +++ 0 Woy) 0+ ++ O (We, 0+ ++ O Woy) (X) 


consequently x has the address 0; ---6,, which is a periodic address. Now suppose that x has the periodic address 6; ---G,. 
Then x is given by 


OP(O, ++ - On) =X = P(O ++ + On) ++ - On) 
= Wy 0+ ++ O Won P(O] -* * Gn) = Wo, O +++ 0 Wo, (X) 


SO x is periodic. We next prove J <=> I 11]. The semigroup of transformations generated by {w, ... wy} includes all finite 
compositions of the maps for the IFS. In particular since w,, 0 --- © Wo, 1s a finite composition of these maps, it is an element 
of the semigroup. It’s fixed point is x. Now suppose that x is a fixed point of an element of the semigroup of transformations 
generated by these maps. Then by definition of the semigroup, there is an element of the form w,, 0---0 we, with the 
property that 


Wg, 0°*:O Wy, (xX) =x 


Consequently x is periodic. 

IV.2.11 We wish to show that any periodic point of the IFS {[0, 1]: 3x, 5x + 3} is of the form p/(2” — 1). This is certainly 
the case for the two fixed points 0, 1. Choose any point x in [0, 1]. Then we claim that after any sequence of k operations, 
we have 


et 
Wo, O°" © Way (x) = 3px + € 


1 


for some number C. This is true since each iteration multiplies x by 4 and multiplies any constant term by the same factor, 
and possibly adds another part to the constant term. If x is k periodic then we have 


1 Cc? 
ee gs or A oH 
Now C is composed by taking the constant term from the last iteration and multiplying it by ; and either adding a half or 
adding 0. Thus the binary expansion of C contains at each digit a 1 if £ was added on that iteration, and a 0 if it was not. 
Hence C has a binary expansion of length k, which, multiplied by 2° gives any possible number from 0 to 2‘ — 1, as desired. 
IV.2.14 The IFS {R; w,(x) = 0, w2(x) = $x + 71 is an IFS with condensation, the condensation set is {0} and consequently 
the attractor A is given by 


A= _) w3(0) U {1} 


n=0 
Any point p € A is periodic since there is some k for which p = w$(0) unless p = 1 which is a fixed point and therefore 
periodic. But this means that 
p = w3(wi(p)) = w3(p) o wi(p) 


so p is k + 1 periodic. a totally disconnected IFS is invertible, it follows that the cardinality of A is the same as that of 
its related code space. But code space on more than one symbol is uncountable by Theorem 1.1 in Chapter IV. Hence the 
attractor is too. 
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IV.2.16 Any uncountable attractor which is overlapping has uncountably many points with multiple addresses. This is 
because it contains an open set of points with multiple addresses which maps to an open set in code space by g@ |. Any 
open set with non-empty interior in code space on more than one symbol is uncountable. 


IV.2.17 One example of such an IFS is any overlapping IFS of two maps on the interval [0, 1] such that the end points of 
the overlapping region w,(A) M w2(A) are 3-periodic. This causes some of the sets of 4 symbols in any given position to 
map to the same point on the attractor, allowing uncountably many addresses for some points. 

IV.2.18 The set of interval end points in C is equal to exactly twice the number of intervals used in the derivation. Following 
the proof of the countable number of triangles in the Sierpinski triangle, (exercise 1.13 in Chapter IV), we have at each 
subdivision, 2” intervals added, and we number them 3” + 1,3" +2,...,3" +2" <3" ' Since this numbering scheme 
produces an integer in order for each interval without repeating, the intervals have been put into correspondence with 
the positive integers. Then their end points can too, since we may give all the left-hand end points the negative integer 
corresponding to the right hand end point labelled as above. Since the integers are countable so are the endpoints. 


Answers to Chapter IV, section 3 
IV.3.10 We first show that the point 1212 is a periodic point of period 2. 
f (1212) = 212 = 2121 
f?(1212) = f (2121) = 121 = 1212 


2 
so this is a periodic point of period two, and the other element of the cycle is 2121. Next we show that it is repulsive. Let 
x, = 1212, and y € Z. Then 


d(f7(xs), f?(y)) =a (xz, f°) 


= |xi — il xi — yil 
= 5 =3-3-) 
i=3 


i=3 
=9-d(x,-, y) — Bll — yl + 12 — yal) 


If we choose 5 > € >O then for y € B,.(x +) we have y; = 1 and y2 = 2, so for any such € 


d(x;, f7(y)) = 9-d(xz, y) 


and the point is repulsive, the cycle is therefore repulsive too. 


IV.3.11 The dynamical system {[0, 1]; 2x(1 — x)} has an attractive fixed point at x; = 5. f (0) =0 so that 0 is also a fixed 
point of the system. To examine the behavior near 0, we first notice that if x 40 then as x — 0 we have (1 — x) > 1. 
Consequently, as we approach 0 the function approaches f(x) > 2x. Since for any x > 0 we have d(2x, 0) = 2d(x, 0) > 
d(x, 0) it is easy to find an € such that the function is repulsive on an €-ball around 0. For example, let « = .1. Then 
d(0, f(x)) = 1.8d(0, x) for any x € B,(0). 


IV.3.14 We want the behavior of the dynamical system {R U {oo}; f(x) =Ax} in four cases: 


(i) A =0 gives us f(x) = 0 which makes 0 the only fixed point, and attractive. 

(ii) 0 < |A| < 1. 00 is a repulsive fixed point since | f(x)| < |x|, likewise, 0 is an attractive fixed point. 
(iii) 4 = —1. 0 is a neutral fixed point, as is oo. Every other point is period 2, and neutral. 

(iv) 1 <A <oo. Since | f(x)| > x for any x £0, 00, Oisa repulsive fixed point, and oo an attractive one. 


IV.3.17 We have 


x+1 ifx <0 


Re od ae if x > 0 


476 Selected Answers 


If x =O then f(x) =1 and f?(x) =0 so that {0, 1} is a cycle of period 2. For any other x split x into its integer part and 
fractional part, that is 


x=n+x Ixj|<l,neZ 


Then if x < 0, f"(x) =x which is between -1 and zero, and f"*+'(x) = 1 + X which is between 0 and 1. Applying f to this 
number gives f (1 + x) = —x, which is again greater than zero, and 


fP71+x) = f(-*%) =14+% 
which is a cycle of period 2. Consequently for any x < 0, x is eventually periodic of period 2. For x > 0, we have 
f@)=fi+x)=-(-1)-x 
so that redefining n := —(n — 1) and x := —Xx gives us the case just examined, and these points are eventually periodic of 


period 2 as well. 


IV.3.18 If {X; w;,..., wy} is a hyperbolic IFS, we may assume a contractivity s < 1. By Theorem 7.1 in Chapter III, the 
dynamical system {71(X); W} possesses a unique fixed point A € H(X). Also by Theorem 7.1, for any B, C € H(X) we 
have h(W(B), W(C)) > s - h(B, C). Consequently, 


d(W(A), W(B)) =d(A, W(B)) >s-d(A, B) 


and the fixed point A is globally attractive (globally attractive means € may be chosen any size). 


IV.3.20 We need a small lemma to proceed. If a space is compact, then for any € > 0, there is an €-net {y; ‘aa of points 
such that every point in the space is less than € from some element of the net. We need to produce a sequence of such nets. 


n(e1) m2) such that {y;} C {z;}. 


Proof: There is always some €2-net which has the €;-net as a subset, since given any €>-net {z;}p? , the finite collection 


of points 


Lemma: Given €, > €2 and an €,-net {y;};_, there is a minimal €2-net {z;};_ 


A (€>) 
{yi}? U (ej? 


is an €2-net containing the €;-net. Now taking the collection of all €2-nets containing this particular €)-net, there is an 
n(€2) > n(€,) which is the minimum size of any of the nets in the collection. Let {z; eae be one of these. 
Now suppose that our code space has the metric 


= [Xi — Yi 
d(x,yy=)>! =a 


f=] 
with some k > 1, perhaps k = N + 1. Let €; = zi , and, with the help of the above lemma, construct a nested sequence of 
€;-nets, 


n(€7) m3) 


n(é}) 
Oiler < Olu. © (5 a 
We build a sequence of points {x;}S2., by letting 
xj = Yi, see Vi j41 24 Pece 2541 Secs Yntej)4 eee Yntej) 5-41 


By construction, x; j visits each €; ball around each element of the €; net by repeated applications of the shift transformation 
T. But code space is compact, and therefore the sequence {x j}72 must have a convergent subsequence, which converges to 
a point x which visits every neighborhood in code space. aes x has a dense orbit under the shift transformation. 

IV.3.21 Let A, B be two open sets in code space. Choose an € > 0 and points o4 € A, and og € B, such that B- (04) CA. 
Then if the metric for D is 


[xi — 
so BE (N+ : 


Selected Answers 477 


there is an M such that for any o € & such that 
O71 = 04,, 92 = 9A, °*' OM = FAy 
we have o € B,(o0,4) C A. Then we only need to choose 
O =04,°**CAyOB 


which is in A, but T“(a) =o € B. Since by choosing the sequence of o’s which agree on M,M+1,M+2,... first 
symbols with o4, followed by og, as above, we get that for allm > M there is an element of A such that T”(o) € B, T is 
mixing on code space. 


Answers to Chapter IV, section 4 
All the problems are examples or figure exercises. 


Answers to Chapter IV, section 5 


IV.5.1 Let {X1; f;} and {X2; fo} be equivalent dynamical systems with the equivalence provided by the homeomorphism 
6: X, — Xo. Let {x1, X2,..., Xp} be a cycle of period p in X;. That is Ff? (xi) = xi, and 


ie oe 
Then 
frlO(xi)) =9 0 fi 087! 0 O(x;) = O(fi%)) 
and 
FP (xi) =O 0 fp0 87! 08 0+++0 fi 007! 0 O(x;) = O(x;) 


so that {6(x1), ...9(xp)} is a cycle of period p. To show that the statement holds in reverse, define the homeomorphism 
6, : X, + X; by 6, = 97", and repeat the above argument. 

The second statement, regarding attractive fixed points, is highly dependent on the restricted definition of the term given 
in the text, and, since it is not preserved by homeomorphism must be dependent on the metric. Choose two copies of the real 
line, (IR, d,) and (IR, d>) where d(x, y) is the usual metric, and d(x, y) is given by 


{ax — By tf x 0 
aen={i ifx >0 
The identity map is then a homeomorphism between these spaces. Now choose the function f(x) = — 5x for both systems. 


In (IR, d;), 0 is an attractive fixed point. For (R, d,), no matter what the size of the € ball is around 0, it will include points 
on the positive side that, because the ball is shrunken on the negative side (for example, B,(0) = (—1/3, 1)), fail to land 
back in the ball on one iteration. Hence this is not an attractive fixed point, even though the two systems are equivalent. 


IV.5.2 An eventually periodic orbit is has the orbit (x), X2,...,%n,Xnt1,---Xn+p = Xn+1}, for some n and some period p. 
As in exercise IV.5.1, we write 
\_ Jet fi fxnt+p 
fala =| ifi=n+p 
The argument is then identical to that in IV.5.1. 
IV.5.3 We modify the example in IV.5.1. Let (R, d;) and (IR, dz) be as in the solution to problem IV.5.1. Let the homeo- 
morphism 6 : R — R be 6(x) =x. Then if fi(x) = —2x, under our homeomorphism we have f(x) = —2x. However, in 


(IR, d;) 0 is a repulsive fixed point, but given any € > 0 a point x < 0 such that x € B.(0) moves under | iteration to a point 
which is closer to 0 under the d, metric than it started. 
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IV.5.4 6 is a homeomorphism, which means that 6 and 67 are continuous. This implies that 9~!(O) is open if O is open 
and (9~')~'(O) = 6(©) is open if O is open. Consequently, if there is an open neighborhood of x in X,, then 9(©) is open 
in X2 and 6(x+) € 6(Q) implies that there is a neighborhood C of 6(x,) in X2 such that C C 6(O). Suppose that for any 
y € O we have 


lim ff(y) =xy. 
Then since @ is continuous 
lim f7@()) = lim O(F2(9)) = Oe) 


for all 9(y) € 6(O), in particular for all elements of C. Then x2 ,= O(x;) is an attractive fixed point of f. Note that this is 
the standard definition of an attractive fixed point. We will use the more restrictive version given in the text, however. 

IV.5.5 The solution above to IV.5.4 shows that this definition of attractive fixed points is preserved under equivalence and 
conjugacy. (and in fact does not require that the spaces be metric spaces). All that is needed is to give a similar definition 
of a repulsive fixed point. For invertible maps, this is done by saying that x; is a repulsive fixed point if x; is an attractive 
fixed point of the map f~', that is, there is a neighborhood © of x; such that for every y € O, lim, f~"(y) = x;y. For 
non-invertible maps, this is extended by saying that there is a neighborhood O of x such that for every y € O there is ann 
such that f"(y) ¢ O. This is likewise preserved under equivalence and conjugacy, a moment’s reflection says that if this is 
true for O, then it is true for any neighborhood QO; C O. 

IV.5.7 We want to show that if 6, a homeomorphism establishing the equivalence of {R; f} and {R; g) is a diffeomorphism, 
then the definition of an attractive fixed point given in the text is preserved under the equivalence. Let x, be an attractive 
fixed point of {R; f}. The definition here establishes that a fixed point is attractive if and only if it is a contraction on some 
€-ball around x;. We let 0(x;) = x, be the corresponding fixed point of {R; g}, and we will show that g is also locally 
a contraction. Choose € > 0 such that f is a contraction on B,.(x;). Then for y € B,(x,), there is an s, O<s <1 such 
that | f(x) — f(y)| < s|x¢ — y|. Since 6’(x+) #0, there is an € small enough (the derivative is continuous by assumption) 
such that for any y € B.(x,), the following expression is well-defined (i.e. none of the fractions are equal to zero in the 
denominator) for y £ x: 


pone Xe--y | 
f(x) — FO) O(x-) — O(y) 


In the limit as y — x, the two fractions on the right approach, respectively, |6’(x )|| Te = 1. But since f is a contraction 


close enough to x ¢, we have 
[2p - IO 4 
which implies that 


bac atk os 
A(x) — 00) | 


Then for € > 0 close enough to 6(x ), we have 
OCF (x-)) — OCF (y))| S s|O(xp) — O(y)| 


so that |g(x,) — g(z)| < 5|x, — z| for z close to x,, and the fixed point is attractive because it is a local contraction. 


IV.5.8 Assume {R; f} is a dynamical system such that f is differentiable for all x € R. A web diagram for a given initial 
condition Xp starts at the point y = xo, and successive points are then taken from (Xn, Xn) +> (Xn, Xn41) > (Xn415 Xn41) Where 
Xn+1 = f (Xn), and the points of the form (x, X,41) are on the graph of the function f (x). It follows that if a point is fixed, 
Xn =Xn+1 = f (Xn) So that this is a point of intersection of the graph of f(x) with the line (x, x) or y = x. 

To prove the second assertion, we note that for our definition of attractive fixed points we have an € > 0 such that for any 
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y € B.(xy), we have a constant 0 < s < 1 such that | f(y) — f(xs)| < sly — x¢|. Then for y 4 x¢ we may write 
If) — Fs) _ | f£O) — Fy) 
ly — x,| y—XF 
Since this is true for all y € B.(x,), it is certainly true as y > x,y, yielding 


f(y) — fs) 
b ame, 3 


<s 


lim 


=|If'@p)l<s<! 
Y->X f 


To go the other way, if the derivative is continuous in a neighborhood of x, and | f’(x)| < 1, it follows that for some 6 > 0 
there is a neighborhood B;(x,) such that for any y € Bs(x,), | f’(y)| < 1. We may then find an 0 < € <6 and a number 
s < 1 such that | f’(y)| <-s for all y € B,. Then the map is a local contraction since 


FY) = FO0) 
ee 


lim 
y> YO 


<Ss 


implies that close enough to yo, we have 


If(y) — fF Qo)| < sly — yol 


in particular, | f(y) — f(x)| < sly — xsl, as desired. 

Finally, to generalize to cycles, let x1, x2,...,x%p be a cycle of period p, and suppose that the cycle is attractive. By 
definition, this implies that x; is an attractive fixed point of the dynamical system {R; f?}, for some x; in the cycle. By the 
result above, we have |(f?)'(x;)| < 1 for this point. This says that 


(f?)'@i)| = lf’ (xi) ge: Ff’ (xp) f'(x1) ete f'(xi-1)| <1 


Notice if this condition holds at x; it holds at any x; in the cycle since the expression is the same product. If this product is 
less than 1, it follows that for at least one of the elements in the product | f'(x;)| < 1, but it can also be the case that some of 
them do not satisfy this condition, only their product must. 


IV5.9 Since we would like our IFS to have an associated shift dynamical system equivalent to 
_ fi—2x xe[0, 5] 
ro=fo i xe(t, 1] 
we look at an IFS which is the inverse of f(x) on the regions in question. Consider the IFS 
1 ae | 1 
{[0, 1]; 5% va. 5 + 5} 


The attractor of this system is [0, 1] as follows: 
W({0, 1]) = 4.0, 11) + = U-= (10, 1) + = 
shee eae ee ee 2 
1 1 1 1 
= (0, sits U[=5,0] + 5 
=[z, 1]U[0, =] =10, 1 
=m 9 ? 9 9 se 9 
That this IFS is just-touching results from the intersection above containing the point 1 and the fact that O = (0, 4) U(5, 1) 


suffices for the open set. The points of multiple addresses are the images of this point. Defining the associated shift 
dynamical system, we have 


_fuw'@ ifaeG, 
we | w;'(a) ifae (0, 3) 
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The point of intersection yields 


Hod a 

(3) = 25 >) 0 
ee | 

Greeks 57-0 


so we define 
1 ae <4 
S(5) = wy (5) = w3 so 50 
and there is no conflict. This explicitly gives S(a) as 


-1 ae[$,1J 
sa) = |" —2a aé[0, 4] 


which is clearly equivalent to f (x) with the homeomorphism being the identity on [0, 1]. 
IV.5.10 When p = 0 we have 


© ai 


So ale 


If we let y = 0, then 


wi(x, 0) = ee 0) 


4 
>= : 0 
w(x, only Wear ) 
1 1 
w(x, 0) = (4% * i 0) 


so that for any point on the x-axis, es (x, y) is also on the x-axis. We now — the 1 vegas of the set [0, 1]. This maps 
to three subintervals, each of length 5 and arranged with their left endpoints at 0, 2 3 and } 4» respectively. The set formed by 
repeated iteration removes two siblcietyalz from each interval formed by the previous iteration, and the attractor must be 
lim,+oo W"([0, 1], 0). Thus for p = 0, the attractor is a Cantor set. 

For p = 1 we have 


§ 
Pi igare 4 
SZ & 
nN” 
i 
Fi WIG 
© ni~ 
i- © NI= 


1 
)G)G) 
2) \¥}\O 
which is a just-touching Sierpinski triangle. 
We next prove that for fixed o € XZ, the map ¢(p) — A(p) is continuous in p, for p € [0, 1]. We may write the three 
transformations in matrix notation as w;(x) = Ax + t;, where 
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so that we have 
Wo, O Wo, (x) = A*x + At,, + to, 
in general, we have 


Wo, O We, 9+ O Won (X) 


= A"x + A" tg, + A” fog—1 + +++ + Atos + toy 


oj 


so that 
: . 2. . 
= wae Atx + Z A'te:., = 2 A'to;41 
i= i=0 


Writing this out for fixed o, and letting 


=(%) 


we have 
o-(p) = ( Liao ee ) 
Sol) So 
which are continuous functions of p with o fixed (they are, for fixed o, differentiable in p, being a form of Taylor series 
expansion). 


That the set J(x) = {y € A(O) : 0(y) = x} where 6(x) = (1)(@) '(x) is the set of y whose paths meet at x is true by 
construction. 


Answers to Chapter IV, section 6 


IV.6.3 The three points in question have addresses 13 = 31, 12 = 21, and 23 = 32. Defining S(a) = WwW} ~!(a) forces the point 
a to become 12, which is eventually periodic since T (12) = 2, a fixed point. Similarly, if we define S(a) = w; > !(a) we force 
the point a to become 21 which since T(21) = = 1 is also eventually periodic. The points b, c are analyzed similarly, being 
forced to either of the two addresses, which after one iteration land on the fixed points 2 or 3. 

IV.6.4 If we create an IFS for the equilateral Sierpinski triangle in which we rotate the subtriangle corresponding to w2(4) 
by 240° and that corresponding to w3(A) by 120° we get a just-touching Sierpinski triangle in which the multiple address 
points correspond to equating the addresses 


13 = 33 31=2 12=722 


Since these points, regardless of which map is used in the equation S(a) = w; '(a) map to the points 3, 1, and 2, on one 
application of the map, S(a) may be chosen without conflict. Specifically, the IFS in question is 
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x + cos 240° —isin240°\ (x 1 
w2 a | ° 1 ° is ; 
y 5 sin 240 5 cos 240 y 0 
x $cos120° —4sin120°\ /x ; 
W3 sis i ° 1 ° J3 
y 5 sin 120 5 cos 120 y ~ 
IV.6.6 The IFS {2; ¢,,...ty} where t, = no has by definition an associated shift dynamical system defined by S(c) = 
t; (a) where o = 0)02---. Then S(o) = 0203 ---, which is exactly T (a). To see whether two such dynamical systems can 
be equivalent for different values of N, we use exercise IV.5.1, namely if two dynamical systems are equivalent, then a cycle 
of period p maps to a cycle of period p. Given N; > N>2, we have for either that any of the cycles of period 1 are given by 7 


where 0 <n < Nj. These are distinct, and if @ were the homeomorphism establishing the equivalence, would map to distinct 
fixed points. But this system for Xy, has N2 such points, which is less than N;. Thus 


02 Ny —> 2 N> 
either does not map fixed points to fixed points or is not one-to-one, in other words there is no such equivalence. 
IV.6.14 First we prove, as claimed in the text, that 
A ={($(0),0):0 € 5} 


is the attractor of the lifted IFS. To do this we need to confirm hyperbolicity, in order to assert the existence of a unique fixed 
point in H(X x LX). Let the metric on X be denoted by dy and that on & be denoted ds. To make things simple, we use a 
Manhattan metric on the product space, namely, 


d((X1, 01), (x2, 02)) = dx (x1, X2) + dz(O1, 2) 
We have, by definition, 
wi((x, 0)) = (w;(x), io) 


and we must show that this is a contraction. w; are the maps of the original IFS, and we may assume that that system has a 
contractivity of s. Then 


= lo, — 02.| 


1 
d(wWj (x1, 01), W;(X2, 02)) = dx (w; (x1), wi (x2)) + Ee d Neh 


1 
< sdy(x1, ——d;(o,, 
< sdy(X),X2) + Nod 5(01, 02) 


Let m = max(s, ;+;). Then 0 < m < 1 and 
d (W(X), 01), W;(X2, 02)) < md((x;, 01), (x2, 02)) 


so each map w, is contractive, and the IFS is hyperbolic. Consequently since X and © are complete, and therefore so is 
X x &, we have the attractor A is the unique fixed point in H(X x L) of the map 


N 
W(B) =|_Jd,(B) 
i=l 


for B a compact, non-empty subset of X x ©. We now verify that the attractor A is as given in the text. For any element of 
the set A = {(¢(c), 0): o € X} and some w; we have 


w;(A) = {(w;(@(a)), ia) :0 € X} 
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but io = d'(w;(¢(o))) so the image W(A) is in A. Let a € A. Then a is given by 
a = (Wo,(T(o)), 1T (7)) 


so a is in the image of an element of A under Wo,- Hence W(A) = A and this is the unique attractor of the lifted IFS. Finally, 
to complete Lemma 1, we have 


w (A) = {ae A:a=(x, lo)} 
w2(A) = {ae A:a=(x, 20)} 


These sets are points with mutually exclusive sets of second coordinates, so their intersection is empty. Hence A is totally 
disconnected. 
1V.6.15 Let {x,} be any orbit of the associated shift dynamical system {A; S}. Then by definition, S(x,) = %n41= Ww; l(xn) 
for some w;. Thus, regardless of the rule for assigning S(a) on overlapping regions, there is a rule of assignment which 
assigns either w7! or w3! to each application of S to the orbit. Let t be the address such that if S(x,) = w; ‘(x41 then 
t, =i. Let 

On = Trtn+1°°* 


Then $(0,) = Xn; On41 = T (on), so that the orbit {(@ (On), On)} of the shift dynamical system {A; S } has the desired property. 


IV.6.16 By definition, the lift is (01, 02)  (T.o1, 01,02) which looks like the shift automorphism: it takes the symbol off 
the first element and puts it on the second. The suitably defined inverse would then be to take a symbol off the second 
element and put it back on the first. This defines a system on & x &, as the space of shifts, which is identical to the shift 
automorphism. 


Answers to Chapter IV, section 7 
IV.7.2 x) = 212 consequently the true orbit of xp is given by 
%=212 T(%)=12 T(%)=2 
T"(q) = 26 2 forn > 2 
By contrast, we have x, = 12 after the first iteration, so that d(%,,, 7"(Xo)) = 1/3 after the second iteration, under the code 
space metric for two symbols. We wish to find a shadowing orbit. Taking 6 = 1/3 and s = 1/3 we need 
= s0 1 
d(Xn Xn) ST FF 


Any real orbit which has the symbol 2 in it will eventually be separated from 12 by at least 1/3 so the only possible choice 
for a shadowing orbit is 1. Computing the distance yields 


= —l|l= 


as 1 
d(12,1) ==) — 
san 


rAa= 


which satisfies the shadowing theorem. 


Answers to Chapter IV, section 8 
IV.8.1 It suffices to show that the positive rationals are dense in the positive reals, since the function #(x) = —x will show 
that the negative rationals are then dense in the negative reals. To this end let r € R, and let its decimal expansion be 
iror3.... Then the sequence of rationals 

r10, rir20, r1or30, eas 


converges to r. Hence every number r € R is a limit point of the rationals, and since the closure of a set contains its limit 
points, the closure of the rationals is the reals. Hence the rationals are dense in R. 
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IV.8.2. The sequence in question is an enumeration of the rationals in [0, 1], i.e. QM [0, 1]. Since Q is dense in R, Q1N [0, 1] 
is dense in [0, 1]. 

IV.8.3 The function f(x) = 2x mod 1 can be viewed as follows: On the circle of radius 1/27, that is, of circumference 
1, define x(@) to be the distance along the circumference in the counterclockwise direction from the positive x-axis. Then 
taking the usual equation of angles 9 = 0 + 27 = 6 mod 27, the function 6 +> 26 gives us our function on [0, 1]. If 6 is not 
a rational number times 27, it cannot be return to itself under this map, since 2”6 = 6 implies that 2"9 = 0 + mm for some 
integer m. Hence any irrational number times 27 for 0 results in a dense orbit on the circle. 


IV.8.4 The dynamical system {[0, 00); f(x) = 2x} is sensitive to initial conditions as follows: Choose 6 = 1, and for any 
€>0 let y #x, and y € B(x). Then since d( f(x), f(y)) =2d(x, y), we have d(f"(x), f"(y)) = 2"d(x, y). Choose n, 
given y such that 2"d(x, y) > 1. In particular, for any n > N where 2% > 1, d(f"(x), f"(y)) > 6. 

The dynamical system {[0, 00); f(x) = .5x} is not sensitive. In this case, we have d( f(x), f(y)) =0.5d(x, y) so that for 
any € > 0 and any 6, there is an N such that for any y € B.(x) d(f"(x), f"(y)) = (0.5)"d(x, y) is less than 6. 


IV.8.5 Let & be the code space on two symbols, and let T be the shift operator. Then {X; 7} is sensitive to initial conditions: 
Choose 5 < }. Then if x, y € E and x # y, there is an n such that x, # y,. Then since 


PG ade OSX Xnits 
T"(y) =T"(yiy2.-+) = YnYn+i ++: 
it follows that 


d(T"(x), T"(y)) > Fao > 8 


Hence whatever € is chosen, we may find a y and an n satisfying sensitivity to initial conditions. 
{X; T} is transitive as well: Let U, V Cc & be open sets. Then we can choose y € V and 6 > 0 such that B;(y) C V, and 
x €U ande > 0 such that B.(x) CU. Choose n such that zi < €. Then any v € & such that 


V = XjX2...X_VjV2...E B(x) CU 
Similarly, choose M such that for any m > M, 
U=Yi)2..-Vmuyu2... € Bs(y) CV 
Then any o € & of the form 
TF = Vy Vo~.. Ve X1X2 . - -X_102.-. 
isin V. T"(o) = X1XxX2...X,0102..., which is inU. Hence T"(V) NU FQ, as desired. 


IV.8.6 Suppose that {X; f} and {Y; g} are equivalent under the homeomorphism 6 : X — Y. Let U, V Cc Y be open sets. 
Then because @ is continuous, 6~'(/), 6~'(V) are open sets in X. Since {X; f} is transitive, there is an n such that 


O'"U)N f" 0d "(V) £B 
Let x be a member of this non-empty intersection. Then 0(x) € U and x € f" 0 6~'(V) implies 
600 'U)NG0 f" 0d "(V) =UN g"(V) £B 


so that {Y; g} is also transitive. Reversing the roles of 9 and 6~', which we may do because 97! is also continuous, yields 
the converse. 


Answers to Chapter V, section 1 
V.1.3 Let (X,d) be a metric space, A C X with A = {a,b,c}. We wish to calculate the fractal dimension of A. Let 
r = min, yca(d(x, y)). Then for any € < r/2, we have V(A, €) = 3. Hence the fractal dimension is 


In3 
D(A) = lim ——— =0. 
(A) ah REN SS 
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V.1.6 Using the Box Counting Theorem to cover the classical Cantor set C with boxes of size 1/3”, we will get 


=. fan) 


We need to calculate N,,(C). It takes 1 = 2° box when n = 0, subdividing and deleting the middle third gives 2 = 2' boxes 
when n = 1. On each subdivision and deletion of a set of middle thirds, we divide each interval covered by a box of size 
1/3” in thirds and retain two of these thirds, so that it will take 2”+' boxes of size 1/3”*' to cover the set at the next step. 
Hence N,,(C) = 2”, and we have 


In(2”) , nin2 In2 


D(C =lim,..~—— = —_ = —. 
ee ie Ce Oe ae 


V.1.7 Use boxes of side length 1/2”. For each quartering of the figure, and its subdivisions, we divide each box into 4 parts, 
and one part contains none of the fractal subset points. Consequently, on each subdivision, the number of boxes increases 
threefold. There is one box of side length 1/2° hence 


In3” = In3 
Di = lim —— = —. 
ae io 
V.1.8 One way to choose the boxes is to take side lengths 1/2”, with the first subdivision running horizontally from just 
below the bottom of the caption, the vertical running vertically from the lefthand ‘tip’ of the bottom piece. Do it with a ruler 
to assure yourself that this divides the figure into three congruent pieces, and leaves the fourth of the four squares empty. 


Following the lead of the previous problem, the dimension is a. 


V.1.9 We wish to show that using a Cantor set constructed by removing middle thirds on the interval [0, 3] has the same 
fractal dimension as the usual Cantor set. We start with 2° intervals of length 1/3°~'. The rest of the procedure is the same, 
except that we now have 

In(2”) nin2 we... n In2 


D(Big Cantor) = lim ——— = lim —————— = — oe ie 
(Big Cantor) noo In(3"—!) —. (n—1)In3 In3 pie EE In3 


V.1.10 Let A C R? be a compact non-empty subset. A has fractal dimension D, in the Euclidean metric, and Dy» in 
the Manhattan metric. The metrics are equivalent, consequently there is a transformation 6 : R? > R? which forms the 
equivalence. By Theorem 1.2 in Chapter V then, we have D,(A) = D2(8(A)) or D; = D>. 

if2 6 


V.1.11 Let B = ( 0 1/2 


) and define wo IFS 
+()2(0)46)40)0) 
(QOH 


These are two Sierpinski triangles, one with right angle at the origin, and hypotenuse between (1, 0) and (0, 1), the other 
with right angle at (1, 1) and hypotenuse between the origin and (0, 2). We rotate one into the other, expanding by a factor 
of two, to generate an equivalence between the two systems. This requires an invertible affine map. One which works is 


(5)=G' 2)G)+Q): 


The two attractors are metrically equivalent, and therefore have the same fractal dimension. 


with attractor A, and 
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Answers to Chapter V, section 2 
V.2.3 Let B be given by 


Then we can use the following IFS to generate @: 


w, = Bx 
0 
w= Bx+(,/)) 
1/2 
ws = Bx +( : ) 


Each map has a contractivity factor of 1/2, so the fractal dimension by Theorem 2.3 in Chapter V is given by 
1/2? + 1/2? + 1/2? + 1/2? =1 


This yields 1/2? = 1/4 or D = 2 as desired. 
V.2.4 We get the fractal dimension of the Cantor set from the IFS 
1 1 2 
(10, 1]; wir) = 3x; wa(x) = 5x + 5). 
Each map has contractivity 1/3 so we have 


1/3? + 1/3? = 1 or 1/3? = 1/2. 


Then —D1n3 = —In2 0r D= nm. to the equation 3(1/2?) + (2/5)? = 1. Solved numerically, this gives D © 1.874. 
V.2.5 The attractor contains 3 images of itself scaled by 1/2, with two of them rotated by 2/2 and shifted. Assuming that 
the attractor is centered at the origin, and extends two units in either x-direction and unit in either y direction, the IFS is 


given by 


where B is given by 


0 —-1/2 
So aes 
The scaling factors are all 1/2, leading to the equation 3/2? = 1, or D= ee 


V.2.8 This attractor has 6 maps, 3 at 1/2 scale, 3 at 1/4 scale. It is overlapping, we need an upper bound which we get from 
Theorem 3.1 in Chapter V. We have, 


3-(1/27) +3- (1/47) =1 
Rewriting this as 
3-(1/2)?? +3- (1/2)? -1=0 
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yields 1/22 = +(/21/6) — 1/2 or 


In(./21/6 — 1/2) 
In2 


V.2.9 Let w;: R’ > R’ for i = 1, 2, be an IFS with scaling factors s;, s2 respectively. 
That the new maps are similitudes follows from the general fact that the composition of similitudes is a similitude. To see 
this write the two similitudes as 


rR, Ro, ie) r2R2Ro, (x) 


Since reflections are diagonal, they commute with the other matrices, the scalars commute as well. The composition of 
rotations is a rotation, the composition of reflections is a reflection, multiplying scalars together is a scalar. We are left with 


r72R,R2Re, Ro, 


which is a similitude. 
The new contraction factors are 


oe SS Se ce 3 
Sy, = S]{Sv_ = 9182S vq = $2518, = 5} 


Using the original system in Theorem 3.1 in Chapter V, we have 
Is1|? + |s2|? = 1 
The new system yields 
Ist1? + 2Isisol? + Iszl? = (Isl? + Is2l?)” = 1 


The calculation of dimension remains unchanged, since the first equation is a solution of the second. 


Answers to Chapter V, section 3 
The exercises in Section 3 are all experimental in nature. 


Answers to Chapter V, section 4 


V.4.1 The fact that M(A, p, €) is a non-increasing function of € is proved by looking at the sums over which the infimum is 
taken. Suppose €; < €2 and that M(A, p, €,) < M(A, p, €2). Since any sequence {A;} which qualifies for €, is a sequence 
of sets with diameters less than €2, it follows that by taking the same sequence in the computation of M(A, p, €2), the 
infimum must be less than or equal to the sum generated by this sequence. But this is a contradiction. 

V.4.2 Let A = {a, b,c, d,e, f, g} be a subset of (IR?,Euclidean). Let 27 be the minimum distance between points in A. By 
choosing € <r, we must, for any € cover the set with at least 7 sets A;,...A7. We can also cover A exactly with 7 sets of 
any epsilon of this size. Consequently, the infimum is generated for any p by the limit of the finite sum as € — 0 which is 0 
for any p > 0. When p = 0 the sum is exactly 7. 

V.4.3 Let A be a bounded countably infinite subset of IR2, under the Euclidean metric. Let p : Zt > A be a rule of 
assignment which takes each positive integer i to an element a; € A, where p is 1-1 and onto. Such a function exists by 
definition of countability. Then the sequence of subsets A; = {a;} of A is in A, since their union is the set A. But the diameter 
of any singleton set in R? is zero, so that for any p > 0 


> Giam({a;}))? = 0 
i=] 
Hence for any p > 0, M(A, p, €) = 0 for any € > 0, consequently, M(A, p) = 0. On the other hand, if p = 0, then for any 


non-empty subset of A, (diam(A;))? = 1. For each n > 1, there is an € > O such that there are more than n disjoint subsets 
of A, hence M(A, 0) = c. 
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V.4.4 For p = 0, given eé < Hs there are at least 2” subsets in any sequence of subsets of diameter less than € which cover 
C. To see this, the endpoints of 2” disjoint intervals of diameter zi are elements of C, and can not lie in the same subset. 
Hence as € — 0, M(C, 0, €) — oo. Since M(C, 0) is the supremum of these numbers, it is also infinite. For p = 1, given 
€ > 0, we may choose the smallest n such that a < €, and cover C with exactly 2” subsets with diameter < €, by taking the 
intersection of C with subintervals of (0, 1]. The infimum for any € > 0 is the value of the sum as n — ov, which is 
git 

Lone SE 
Since M(C, p, €) = 0 for any € > 0 we have M(C, 1) = 0. 
V.4.5 The limit scheme forces a minimization of the number of sets by the diameter of sets. Suppose we attempt to find 
M(A, p) such that this number is less than infinity, but non-zero (this is not always possible). For the Sierpinski triangle, we 
may proceed by using a sequence of € values that are the diameters of the subtriangles. That this is sufficient is guaranteed 
by problem 1. For any p then, the values calculated for exactly the sequence of subtriangles leads to the infimum desired. 
We know that calculating on the subtriangles, we can minimally cover A with 3” triangles of diameter d -2~" where d 
is the diameter of A. Then we have, for each n, p a value of d - 3” - 27”? as a candidate for the infimum. Notice that as 
n —> oo for p small, this is an increasing sequence, consequently our infimum at fixed n, p is the value at n. For p large, the 
sequence converges to 0, so that we have one infimum for all € values. These p values correspond to infinite or zero values 
for M(A, p, €). The value at which we will get a finite nonzero number is when, independent of n, we have 2”? = 3”. This 
value is p = log, 3 = (In3)/(In 2). At this value, the terms in the sequence are identically d, the diameter of the Sierpinski 
triangle. 
V.4.6 As the following proof will indicate (exercise V.4.7), to get strict inequality one must have a situation where it takes 
more sets which are box shaped to cover A than if another, better shape is chosen. 


V.4.7 We have by definition, 


for the fractal dimension, where the diameter of each box is € times some fixed constant. Fixing € > 0 and rearranging this 
formula, we have 


D(A) In (=) = InN. 


| oni, 


¢ D(A) 2 


N 
ao re 
t=1 


Since for A; chosen such that A; = B; A, that is the intersection of each box with the set A, we have diam(A;) < B,, it 
follows that for this choice of €, M(A, D(A), €) < 00, since it must be less than or equal to this choice of A;. Since D(A) 
converges, this must then be the case for every choice of €, and we have that 


lim 2 tdiamiayy? a 


for p = D(A). If this sum is zero, then Dy(A) < D(A), if it is not, then Dy(A) = D(A). Notice that if M(A, D(A)) = 0, 
we cannot automatically conclude that these two are unequal, if it is non-zero then they are equal. 

V.4.10 Let Ai, Az € (R”, Euclidean), be two metrically equivalent sets, and let 6 : A; — A> be a metric equivalence. Then 
for any B C Aj, we have diam(@(B)) < C,diam(B) and diam(B) < C,diam(6(B)) for some constants C,, C>. Conse- 
quently, if (A, p) = 00, then for any choice of subsets, and « > 0, we must have >; (diam(B;))? — oo. Thus for 
any choice of subsets on A» this sum for the images of B; is equal to }-*° ,(C2diam(B;))? which is therefore also infinite. 
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Hence M(A2, p) = oo. Conversely, if for some p, we have M(Aj, p) = 0, then the infimum over ¢€ values for all choices 
of subsets of A; is zero. Consequently, for any 5 > 0, we may find a set of subsets satisfying diam(C;) < € for which 
M(A, p, €) <6 for every epsilon by choosing a sequence of subsets whose sum is less than 5/C, and using the metric 
equivalence to generate the sequence in A2. Consequently M(A2, p) = 0 as well. Obviously by using 0~', and reversing 
the role of the constants, we can do the same, given a value on A>. Since the two sets agree on infinite and zero values for 
M(A, p), then the point at which the two values change are the same. Hence Dy (A) = Dy(A2). 

V.4.11 Proving that a metric equivalence generates the same Hausdorff-Besicovitch dimension, is identical-to exercise 
V.4.10. The second statement, that the Hausdorff D-dimensional measure M(A, Dy(A)) is not preserved may be shown 
by example: Looking at example V.4.8, we may take a Sierpinski triangle with vertices at (0, 0), (2, 0), and (0, 2). Then for 
each step n in the example, we cover the triangle with 3” closed disks of diameter 2/2- 3H , and the example proceeds as in 


the text, except that 
M(A, Dy(A)) = (2V2)29/"? 
Consequently we may take the original Sierpinski triangle, with the metric d2(x, y) =2- d(x, y), or twice the Euclidean 


metric, and obtain a different value for the Hausdorff p-dimensional measure. 
V.4.12 They should be (inches)!3?!. 


Answers to Chapter VI, section | 
VI.1.2 The Taylor expansion for sin(x) near 0 is given by 
fo, ¢) 
(—1)*¥x rt) PS 
cgulae Bare ge. + 57 +... 
giving a linear approximation of £(x) =x. Let e > 0. Then a linear coordinate transformation from [0, €] x [0, €] to 
[0, 1] x [0, 1] is given by O0((x, y)) = (, ae With the change of coordinates we still get £’(x’) = x’, and the graph of this 
function, L is (x, x) € [0, 1] x [0, 1]. Transforming the function y = f(x) = sin(x), gives us 


y= PeteroyootGse 


so the graph G is the set (x, Me € [0, 1] x [0, 1). 
The Manhattan metric distance between a point g = (x, eee on G and a point / = (y, y) on L is given by 


sin(€x) 
d(g,l)=|x —y|+ ae Ge —y| 


The shape of an € ball in Manhattan metric is a square with vertical and horizontal corners. So to get the point on L closest 
to a given point g € G, we look for the intersection of the smallest isosceles right triangle with right angle at g intersecting 
L, which since the slope of L is 1, intersects from (x, x) to (2, **)) at the same Manhattan distance. Hence for any 
g € G the Manhattan distance to L is 


ts a= sin(€x) 


The derivative of we is decreasing, and this function is always less than x in this domain, hence the distance from G to L 
is given by the maximum, reached when x = 1, or 


ia D1 


Going the other way, we base an isosceles right triangle with the right angle at (y, y) and look for the smallest such triangle 
intersecting G. Once again, since the slope of G is decreasing, this will touch when we take the first coordinates equal, 


490 Selected Answers 


hence 


a(t, G)=y- 


and this is maximized at (1, 1). Hence 
He Gy S12 me 4 ah GF 
Noting that this is indeed decreasing as € — 0 since 


sin(€) = 


e—0 € 
we calculate 
h(L, G) = .01 > .99e = sin(e) 


or € less than approximately .1. 


Answers to Chapter VI, section 2 
VI.2.2 Let 


f(Qx)= Fit he 


Xj-1 


The first step is to show that if (x, y) € G then so is w,(x, Let (x, y) € G, that is (x, y) = (x, f(x)). Then (x, y) goes to 


or | 
perks 1 *N*i-1—x9x; 
XN—XO x IN —-XO 
(Et) x 4 INFin1—*0Fi 
XN —XO XN —%0 
To compute f(x) for the transformed x we do 
Xj — Xi-1 XNXji-1 — XOX 
x + —————_ - x; 
Xn — Xo XN — Xo 
Se. eo tet es XNXi-1 — X0Xji  Xi-14%N — Xi-1%0 
XN — XO Xn — Xo XN — Xo 
_ Xi Xi-1 Xj-1X0 — Xo%X; 
Xn — XO XN — XO 
Xi — Xi-1 
=i —4291 = 
XN — XO 
Dividing by x; — x;_; yields 
a Ag 
Xn — XO 


and multiplying by (F; — F;_-,) and adding F;_, yields 
(FP; — Fj = 0g) x F;_\Xy — F;-1X0 
XN — XO AN = XG 
(= - ==) Pix FX 
= ———————— 1X ot pea eee i ee ea 
Xn — Xo XN — Xo 


=cx + fj 
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hence 
N 
J wi(G) c G 
i=1 


To show equality it suffices to show that the endpoints of the graph map to the endpoints of the i" interval under w; hence 
showing that the map is onto. We substitute xo, xy for x above to get 


Xj — Xi-1 XNXj-1 — X0Xj 

Qty + eS hn tee 
Xn — Xo Xn — XO 
Xj — Xi-1 XNXi-1 — X0Xi 

Ajio +e ee RI 
XN — XO XN — XO 


so that the union of the w; is onto. Hence 
N 
|_Jwi(G) =G 
i=l 


as desired. 


VI.2.5 For this exercise we are asked to find an IFS whose attractor is a parabola passing through the points (0, 0), (1, 1), 
and (2,4). There is only one such parabola of the form y = Ax? + Bx + C, which we will find, and its equation is 


f (x) =x?. The IFS is of the form 
x Q\ 0 E e1 
w pe ~ 
(3) & a)(5) (4) 
x a, 0 x €2 
Ww — a 
GC) =(S a)G)+@) 
which we can insure passes through the interpolation points by using the equations for the various terms given in the text. 
To insure that this IFS has the parabola y = f (x) as its attractor, we substitute a point from this graph into the equation to 


*( io) *(aerare+s) 
7\ f(x) cox + dof (x) + fr 


(a;x + e;)* =cjx +d;x? + fi 


so that in both cases, 


We calculate 
gph Ose th ip O8 
asa €) = 7 = 0 
a=) =! a= P= 
Pg i. Seer ee ee PO cx ee | ae 
Cc) = 2-0 a17-0 = 2 2d, hi ee d, 4.) = 6 
— oes oa Se 
2= 5 be H=5-2k h=H-hy=! 
yielding 


1 1 
(5x) a (5 — 2d,)x + d,x? 


1 
(5% + 1)? =(1 —2d,)x +a>x’? +1 
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These are solved by equation of coefficients: 


l 
—-iod 
4 1 
l 
5 ~ 2d =0 
] 
ae 


and 


1 

a 

;7 

1= —- —2d 
5 2 


This leaves c,; = 0, and cz = 1. We have 


VI.2.10 By exercise II.2.11, the Euclidean and Manhattan metrics on R? are equivalent, so it suffices to show that the metric 
in Theorem 2.1 in Chapter VI is equivalent to the Manhattan metric on R*. We do this by looking at the three possible cases: 
If 9 = 1 the two metrics, d from Theorem 2.1 in Chapter VI and dy the Manhattan metric, are identical and therefore 
equivalent. 
If 6 > 1, then since 4|y; — y2| = ly: — yal we have d((x1, y1), (%2, y2)) = du (x1, ¥1), (2, ya2)), and since $|x; — x2| < 
|x; — x2] we have 


l : 
gate 1)» (X25 ¥2)) < du ((%1, Y1), (X02, Y2)) S A((H1, V1), (2, y2)) 


and the metrics are equivalent. 
Finally, if 9 < 1 then we have, by the same argument 


Odu((x1, 1), (X25 Y2)) < A((x1, Yi), (X2, Y2)) S du (1, yi), (2, Y2)) 


so they are equivalent in this case as well. 

VI.2.13 If we look at any IFS of the form {R?; w;(x, y)}, then if this IFS is hyperbolic, by definition w, is a contraction 
mapping, and the attractor of the IFS is a single point. But for N = 1, the curve we are attempting to interpolate has 
two interpolation points. Hence the attempt will always fail. Another way to look at it is that by exercise VI.2.5, these 
interpolations can always be arranged to be quadratic, a quadratic interpolation always requires at least 3 values. 


VI.2.14 By definition, we have 
(Tf )(x) = cnlz'(x) + dn f (C5 '(x)) + rn 


where £,,(x) = a,x + e,, on each subinterval [x,_1, x,]. We need to show that for f €¢ ¥, Tf is an interpolation function. It 
is shown in the text that Tf € F and hence does the proper job at the endpoints. What is necessary is to show that 7f takes 
each function f to a function which agrees with f at the internal interpolation points. By definition, 27' takes x,_1 to xo and 
X, to xy. Because f € F, f takes xo to Fo, and xy to Fy, and we have 


(Tf) (Xn-1) = CpXo + d, Fo + Th 
(Tf)(Xn) = CnXn + d,Fy + fr 
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into which we substitute the definitions of c,, d,, and f,, to get 


(Tf) Gnr-1) = (F,X0 — Fr-1X0 — dh Fuxo + d,Foxo 
XN — XO 
= quad + d,Foxn — d, Foxo + xnF,-1 — XoF, — d,Xn Fo + d,XoF y) 
= Fn-l 
(Tf)@.) = (F,Xn — Fa-iXn — dn F nxn + dy Foxy 
Xn — XO 
+ d, FnxXn =— d, FnXo + xnF,-1 = XoF,, — d,Xn Fo oe d,XoF ny) 
=F. 


as desired. Since 7 f interpolates the data for any f € F if f is a fixed point of the transformation, then by definition, the 
graph of f is the attractor of the prescribed IFS, and f is an interpolation function to begin with. 


VI.2.15 We wish to construct an interpolation using quadratic scaling, in which the w,(x, y) take the form 
Wa(x, Y) = ((anx + €n)s (Cnx + day + Bay? + fn) 


As before, we need to take the data pairs (xo, Fo) to (X,-1, Fn-1) and (xy, Fy) to (x,, F,). Since the transformation in x, is 
the same, we may define a, e, as before, namely 


Xn — Xn-1 Xn-1X¥N — XnXO 
a, = oo and PO ere 
XN — Xo Xn — Xo 


This will insure that the first coordinates of the data pairs are mapped properly. For the second coordinates, we must satisfy 
the equations 


F,-1 = CnXo + dn Fo + 8nFo + fn 
Fy = CnXn + dnFw + 8nFX + Sr 
Notice that c, and f,, must coincide with the linear case when g, is zero. This gives 


_ &o fee Fy — Fo 


n n SS: &nXc 
Xn — Xo XN — XO 
where we need to fill in for X,, and 
io InP y-1 — X6F p oe Xn Fo — XoFn = anX;y 
Xn — XO Xn — XO 


There are no quadratic terms in the equations to be satisfied, consequently, these must cancel at xo, xy, giving us 
BnF iy = BnXcX0 + BnX 5 
&nF x = 8nX Xn + BnX fF 

Dividing out g,,, and subtracting these yields 
Fy — Fo = (Gn — x0) Xe 


Gas 


Xn — Xo 


X¢ 


Putting this back into the first equation after dividing out g,, yields 
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F2 — F? 
X;= Fj - =i 
XN — XO 
Ds xn Fe — xoF3 _ F2x0 + F3x0 
Xn — XO 


xn Fe = xoF2 


XN — XO 
so that we have 

Bis By i> Ev te: + SR 
oe seer — oe 

Xn — Xo Xn — Xo Xn — XO 
f xv F; a — xoF,, * xn Fo — XoFy xn Fe — xoF? 
nn Se 

Xn — XO Xn — XO "Xn — Xo 


to complete the coefficients for the quadratic case. 
VI.2.16 Consider the integral 


xN 
i= / f (x)dx 
x0 


which here is well defined, since f(x) is continuous and of bounded variation. We assume that f is the fixed point of the 
fractal interpolation transformation, i.e. Tf = f, where (F, d) is the metric space of functions which match the endpoint 
data, and are continuous. Then we have 


XN N Xn 
t= [pods = > | rAwdx 
xO n=1 “*%n-1 


B 


l-—a 


N x 
29 one +d, f(x) + frld(Gnx + en) =al+ B= 
n=1 “%0 


where 
N 


N XN 
a= > Gntl and p= Lm [ (CpX + fnr)dx 


n=1 


We need to show that under the standard assumptions, |a| < 1, we have 


N N 
lo} =| > anda < > landa| 
n=] 


n=] 


and 
Zz a, = 1 and la,| < 1 
The x, are ordered, consequently a,xXo + €n < 4nXy + €, which implies a, > 0, hence |a,,d,| = a,|d,|, and therefore 


N 
lal < ) anldn| 
n= 


which implies |a| < 1. To show that 
XN 
| fo(x)dx = B 
x0 


we merely note that 8 is independent of d,,, and the result follows from above. 
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VI.2.17 We want the integral 


xN 
l= / xf (x)dx 


0 
under the same assumptions as in exercise VI1.2.16. Once again since Tf = f, we have 
XN N Xn 
n= fo x@p@dx= > [ xerpnas 
x0 = Xn 
so we transform variables. 


(Tf) (Qnx - en) =CpX + d, f (x) + Tn 


and we replace x with “4 to write the same integral as 


IN e= 
=f OY east die 
n=1 


n 


3N x 
= 3 i = (Ca + dy f(x) + fn)d(AnxX + €n) 
x0 


n=] 


ft =" (ax + d, f(x) + fn)d(anx + €n) 
n=1 “0 


Gp 


The first integral can be written as 


x 


XN -“ N d 
/ —(CnX + fn)d(AnX + €n) + / — xf (x)d(anx + é€,) 
xy n xn 


xN N 
= / (cnx? + fnx)dx + dn / xf (x)dx 
XO x0 
xN 
=| (c.x" + Snx)dx + d,l; 
The other half looks like 
IN IN 
En | (CnX + fn)dX + Ann / f (x)dx 
x0 x9 
which using the notation /p for the integral / in exercise VI.2.16 yields 


xN 
En / (Ca¥ + fn)dx + dnenlo 
x0 


We now let 
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r 
Ms: 


xN 
/ (CnX? + fnx)dx 


x0 


| 
il 
poo 


XN 
(CrX + frdadx 


I 
Me= 
— 


3 
Ul 
pod 


wm 
rT 
Cir 
S 


a 
i 


© 
I 
t= 
= 
2 


3 
u 


Then we have 
L=y+el, — (hl) + 4) 


_¥—Ght+6 

= = Pag 

_ y—8— @B/(1 —@) 
2 —: 


where we have used the notation of VI.2.16 for the integral Jo. We therefore must restrict the d, such that « # 1. Once again, 
y, 6 are independent of d,,, and we arrive at the integral J; = y — 4 in the case that d, = 0 for all n. 


Answers to Chapter VI, section 3 


The exercises in Section 3 are all experimental in nature. 


Answers to Chapter Vi, section 4 
VI.4.1 Define the metric on R x Y by 


d((x1, yi), (X2, Y2)) = [x1 — x2| + Ody(y1, y2) 
where a = max{q; :i =1,..., N}. 


dy(M,(a, y)s M, (0, y)) = cla es b| 
dy(M,(x, a), M,,(), y)) = sdy(a, b) 


and 0 = an c,s > 0, and under the usual assumptions about the ordering of x;, 1 > a > 0. We wish to show that under 
these assumptions, the IFS is hyperbolic with respect to the metric d. Under each map, (x, y) +> (Ln(x), Mn(x, y)) so that 


d(Wn(X1, Yi), Wa(%2, Yo)) = A((Ln(X1), Mn (x1, ¥1)), (Ln(X2), Mn(X2, y2))) 
= |Ln(%1) — Ln(x2)| + Ody(Mn (1, v1), Mn(x2, y2)) 
= |AnX1 — AnX2| + Ody(M,(%1, 1), Mn (2, y2)) 
= |an||x1 — x2| + Ody (Mari, yi), Mn (%2, y2)) 
< alx, — x2| + Ody(M, (41, v1), Mn (X22, y2)) 


The second half of this expression can be decomposed using the triangle inequality to read 
dy(M,.(x1, yi); M,(x2, y2)) 


< dy(M,(x1, yi), M,(%2, yi)) - dy(M,,(%2, yi), M,(%2, y2)) 
< c|x; — X2| + sdy(y1, y2) 
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Putting this together with the statements above, we have 


d(Wna(X1, 1), Wa(X2, Y2)) < a|x; — X2| + Oc|x; — x2| + sOdy(y1, y2) 
2a+l-—a 
ee 
e 2 
l+a 
See 
See 


Since 0 < a < 1, we have 1 +a <2 so let s’ = max(444, 5) < 1 then 


|x; — X2| + sO@dy(y;, y2) 


|x1 — x2| + s@dy(y1, y2) 


d(Wn(X1, Yi), Wa(X2, Y2)) < $8'|x1 — X2| + 8'Ody(y1, yr) = S’d((x1, y1), (X2, Y2)) 


so the IFS is hyperbolic as desired. 


VI.4.2 By Theorem 4.1 in Chapter VI, the attractor exists. We devise the operator T : F — F as in Theorem 2.2 in Chapter 
VI to be 


(Tf)(x) = Ma(f(Ly'(%))) x € Bent, Xn 


where F is the set of continuous functions f : [xo, xv] — Y. Ly, is a,x + e, which is continuous with continuous inverse, f 
is assumed continuous, and M,, is again linear, so that Tf is a continuous function on each of the [x,_;, x,]. We must assure 
that T takes F into itself. We have 


X1 — X0 XNXOQ — XOX} 
L (Xo) = Xo +e} = Xo + = Xo 
XN — Xo XN — XO 
XN — XN-1 XNXN-1 — XNXO 
Lyn(xn) = ANXN +e), >= XN EO CS x 
Xn — Xo Xn — XO 
Xn — Xn-1 XNXn-1 — XOXn-1 
Lie = QAnXn-1 + Cn — = Setoerrie cena TS | + Se ee Xn-1 
Xn — Xo Xn — Xo 
Xn — Xn-1 XNXn-1 — XnXoO 
LX.) GgXa + = i. ete, 
Xn — Xo Xn — XO 


so the endpoints of each segment conincide with the next and last, and x9, xy map to themselves. Hence these points are 
fixed under L, and therefore under L>'. By definition, 


(Tf) (Xo) = Mn (%o, Fo) = n-1, Fn-1) 
(Tf (xn) = Mian, Fu) = Gn, Fr) 
hence JT : ¥ — F as desired. We now show that 7 is a contraction mapping. Let f, g € #. Then we have 
Pie fai) 
(Tg)(x) = M,(L;", g(L;'(x))) 


With the given definition of a metric, and the restrictions imposed on M,, we have 


d(T f, Tg) = max Gr(Ma(L, (x), f(Lz'(x))), Mn(Lz (x), a(L;'(x)))) 


x€[Xp,—1> 


max Sdr(f (Ln (2), g(L;'(x))) 


ae <1 b oe Be 3 


<d(f, g) 


as desired. 


VI.4.8 We want to show we can generate polynomial graphs with affine hidden variable interpolation. Using the proof given 
in exercise VI.4.2, we have (7 f)(x) = M,(L, she ae ee '(x))). Consequently, the attractor of the IFS is given by 


f (x) = M,(L;"(x), f(L;'(@)) 
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where we let f(x) = (y(x), z(x)). We are interested only in fitting the first component, the second component may be 
chosen for ease of manipulation of the equations. We use the notation of the book, namely 


L(x) = nx + ep 


XxX 
= y Cak + Sn 
(>) =a (2)+ (S24) 


4 


From this we get 
i) dap Beed - Seen dose +: + 
z(x) "\ 2(L7'(x)) kalba) Bx 
from which we extract the equation for y(x) to get 


y(x) = dny(L7"(x)) + hnz(Ly' (x) + fa + OnLy (2) 


Newton proved that it requires N + 1 interpolation points to generate an N th degree polynomial, so that we assume that we 
have been given these points, and that they fit y(x), and likewise assume 


y(x) = ayx™ + ay_\xX! +--- +a 

Since L,, is linear, so is L>', so the degree of y(L; '(x)) is also N, and we let (for notational convenience) 
y(Lz'(x)) = Buy x” + By-1_x | +++ + Boy 
Then we have 
ayx™ +--+ + 0% = dny(Lz'(x)) + hnx(Ly (x) + fn + nL, (X) 

= dnBn,X™® + +++ dnBog + hnz(Lz'(x)) + fa + CnLz (x) 

which gives 
(ay — dnBn,)x™ + +++ (G0 — dnBon) = hnZ(Ly'(*)) + fa + CnLz (*). 


Now d,, as in the non-hidden variable case, is independent of the fitting of the points (x;, F;), hence on each subinterval, we 
may choose 


QN 
Then the left hand side is a polynomial of degree N — 1 so we may choose h,,, and let 
z(x) = yy-ix* | +--+. + 
in such a way so that if 
By—ipe | + +++ + 80, = Z(L7'(x)) 
then we have 
hndin = Hj — An Bin 
for i > 2 and 
Indy + Anb0q + Cab (X)) + fn = (Ot — AnBin)* + (0 — bnBon) 


by equation of coefficients. Thus we may generate a hidden variable fractal interpolation function for a polynomial of 
degree N. 
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Answers to Chapter VI, section 5 
VI.5.1 From Theorem 5.1 in Chapter VI, using three maps, we may choose 


7 oY : 3 
=(8 = )(2)+( 

0 Ci d, Z 

x ; 0 O 3 
(2)-(3 2 9)G)+(3) 

Z 0 « & Zz fs 


1 


For the Sierpinski triangle, the appropriate values are given by a; = d; = 5,5; =c; = 0, and e, = f; = 0, e2 = 7 fo = 0 and 


e3=0, fa=35.- 


Oo Owl- 
or & 
ss oO 
i 
ae i 
NS & 
Nene 
-+- 
RT aa 
oo min os pees & © 
eee wee 


Answers to Chapter VII, section | 
VII.1.1 Let {R?, f} be given by 
(2x,2y—1) if y>1/2, 
f(x, y)={ Qx-—1,2y) ifx>1/2, 
(2x, 2y) otherwise. 


If (x, y) > 0 and (x, y) ¢ S, then eventually y > 1/2 since y is doubled if x > 1/2 or if neither x, y > 1/2. Once this 
happens we have 


7S, y) = (2"x, 2”"y oe y eee 
Given y > 1/2, we have 
lim 2"y — 2"! = lim (2y — 1)2""' = 00 
n->0O n->0O 
and x — oo, as well. Hence these points escape to infinity. If y < 0, x > 0, then eventually 2”x > 1/2, if x < 1/2. Once this 
happens we have 


lim 2"x — 2"~! = lim (2x — 1)2""! = co 
n— co 


n->0O 
for any point whose x coordinate is larger than 1/2. The y coordinate remains negative, and 2” y —-> —oo. Hence these points 
also escape, in this case in the direction of (co, —0o). For points y < 0, x < 0, each coordinate remains negative indefinitely 
and these points diverge as 2”x —-> —oo and 2”y — —oo, so these points escape to (—00, —00). 
VII.1.3 The condition x? + y? > R determines the color based on the iteration number. This is passage past a circle of 
radius /R. Since if f(x, y) = (2x, 2y) we have f"(x, y) = (2"x, 2”y), and 
R 

(2"x)? + (2"y)? = 2"(x? + y2) > Rox? +y’?> ee 
we have concentric circles of radius LR 
VII.1.6 These two functions, 


w,(z) =rez—1 wy(z)=re®z4+1 


take a disk of radius R, to a disk of radius r R, rotate it by 9, and move its center to either +1. Consequently, if we can find 
a disk which is mapped into itself by both maps, we know that the attractor of the IFS is interior to the disk (see exercise 
VII.1.8). We thus look for R as a function of r such that this is the case, and restrict it such that its two images do not 
intersect. This will force the attractor of the IFS to be totally disconnected. 
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We make one further simplification: If the rotation were 0, we could examine the transformation on the real line, since 
these transformations take circles to circles, this will correctly predict the values of R. This reduces the problem to looking 
at w;(x) = —rx + 1. We look for a cross section of the ball [a, b] such that the images 


w;({a, b]) =[—rb—1,-ra—1] w»((a, bj) =[(—rb +1, —ra + 1] 


both lie within [a, b]. These two intervals are of identical length and orientation, differing only by a shift of two along the 
real line. Consequently, our conditions are 


a<-rb-—-1 b>-ra+l 
Solving at equality yields 
a=-rb—-1 
b=-ra+l 
a=r’a-r-1 


(l—-r?)a=-r-1 


Symmetry now dictates that b = ~ , SO that a disk of radius R = 1/(1 — r) maps inside itself under each map of the original 
IFS. We now want the condition on r such that these images of the disk Br(©) do not intersect. This happens when a disk 
of radius rR, centered about 1 does not intersect the y-axis, namely that 


. <1 
l-—r 


r<l-—-r 
Er <a 
pel2 


Consequently, for the original maps, as long as r < 1/2, the maps 
w,(A) M w2(A) C wi(Br(Q) N w2(Br(O) = 


and the IFS is totally disconnected. We must now define f(z) such that on the attractor, we do this with three maps (letting 
l= ZI + iZ2), 

Ww if + on es 

f(z) = 4 2 ifa2 iS 

re°z+X A= atin otherwise 
VII.1.7 This is essentially the general case of the preceding problem. Let {A; f} be the associated shift dynamical system 
of a totally disconnected hyperbolic IFS on R?, where A is the attractor of the IFS. Then we must have 

f@=vu;'@ — foraew;(A). 
Since the IFS is totally disconnected, we have 
w;(A)N w,;(A) = Hig j 


Furthermore, since the IFS is hyperbolic, w;(A) is compact for each i. Consequently, in R? this means d(w;(A), w;(A)) > 0 
if i # j. Then we may extend {A; f} using any function g(x) which smoothly connects the maps w;' in the intervening 
distance. For example, since there are finitely many maps, we may define 


e= min CURL A), wj(A)) > 0. 
ifj 
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Now construct functions which are equal to w; ! on w;(A), and decrease to zero at a distance less than €/2 from these sets, 
by taking functions 


w(x) = = se wel (x) if d(x, w;(A)) <1 
0 otherwise 


with k > 2. Finally, sum these together to get 


N 
B(x) =) t;(x) 


i=] 


Because the distance « was chosen to be less than the distance between images of the attractor under w;, these functions are 
continuous, agree with f on the attractor, and since k can be taken at any value greater than 2, there are infinitely many of 
them. 


VII.1.9 We get the iteration formula by generating the real and imaginary parts of f(z) =z —A. Letting z= x +iy, we 
have 
2=(x tiy =x? + 3x7iy + 3xi*y? + Py? 
= x3+i3x7y — 3xy* —iy? 
= (x° — 3xy”) +i(3x?y — y’) 
so that the new formulas, letting 4 = A; + iA2 are 
newx = x°3 - 3*x*y"2 - lambdal 
newy = 3*x°2*y - y~3 - lambda2 
The choice of R can be derived as follows: Writing z = re’®, we have z? = re, which has a distance from the origin of r? 


compared to the distance r of z. Subtracting 4 will have maximal effect on this distance when z° and A lie on the same ray 
from the origin, namely when A = pe'* and ¢ = 36. Consequently the problem reduces to this maximal case, namely that 


re—|aAl>r 
or 
r>—r—|al>0 
The minimum value for |A| is zero, and the inequality acheives equality for this A value when r = 1 + |A|. Choosing this as 
a candidate value for R, we find that 
+ ap?—-G +a) — Al =a? +3)a? +3) + 1-1 -2)A 
= |A|? + 3|A|? + [Aj > 0 
so that for any A € C, choosing R > |A| + 1 will work in the escape time algorithm. 


Answers to Chapter VII, section 2 


VII.2.1 In this case X = C, we must show the presence of sets such that W(A) = A, where W(A) = w,(A) U W2(A). 
Choose A = {QO}, then w;(A) = w2(A) = O. Likewise, choose A = {oo}, then w;(A) = w2(A) = {oo}. Consequently, the 
unrestricted IFS in this case does not contain a unique invariant set. In point of fact, neither of these sets above are attractors 
of the IFS. The system is not contractive around these points, which are repellors. The point z = 1 is also a fixed point of 
the system, it is a repellor, and is part of J(f). 


VII.2.2 Let A € (3/4, 5/4) and f(z) =z? — A. We want to look at solutions to 
2+z+(1—-A)=0. 
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Using the quadratic equation we obtain 


~t+ Ji —444) 2 —-1+/4, —3 
2 SS 2 


We let 
—1+/4A -—3 
1=-=e_——_—- 
Z 
—1—/§/4, —3 
es ee 


The easiest way to examine cycles of period 2 here is to check to see whether f(z,) = z2 and f(z2) = z, (different ap- 
proaches are used for different problems, this avoids quardic equations here). We have 


De ca ay een he eta EN ee 2} ioe 

ee , a eee (A—>5) 2 

, ti wait ta lM -3 
pee i y 


Subtracting 4 from each of these yields the desired values f(z;) = Z2, and f (22) = z;, hence this is a cycle of period 2. 
To check whether the cycle is attractive, we have 


f?' (2) = 423 — 4az 

We can simplify this for each z; by making use of the period 2 properties: 

Zp = (22 +A)Zy = U2. FAA 

23 = (2) +A)Z2 = 212 FAD 
giving us 

f? (a) = f? (22) = 4zizz 

and 
1—(4.-—3) 4-4, 


=—— =1-A 
4 4 


For A € (3/4, 5/4) we have (1 — A) € (—1/4, 1/4) so that 


42122 =4 


IP @I< ; ai 


Hence this is an attractive cycle of period 2. 

VII.2.3 Let A be specified and suppose a - i € J,. Then —a? € J, and a‘ € J, are real line points. The set J, is connected, 
therefore there is an extreme point of this form which maps to the left most extremity of J,, on both sides of the real axis. 
The pair of IFS cut J, at the negative extremity and map this image to the left and right side, depending on the definition of 
the square root in the two IFS equations. 

VII.2.5 For the dynamical system {C; f(z) =z? —A:A > 2}, we wish to show that the orbit of the critical point {O} 
converges to the Point at Infinity. We know that the basin of attraction for co contains at least the points for which 
r>1/2+./1/4 +4 so that if 1 > 2, we have rr > 1/2 +3/2 =2. f2(O) = f(—A) which is 2 when A = 2 and increases for 
increasing A. Hence lim,-... f"(O) = oo. By Theorem 2.1 in Chapter VII, the IFS {X = C \ Boo, €); +VZ +A, —Vz +A} 
possesses a unique attractor. 
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Answers to Chapter VII, section 3 


VII.3.1 The Newton’s transformation associated with F(z) is given by 


F(z) z+ —1 
HQ=f- —— = f= 
f (2) Fp Ses 
To see that 1 is indeed fixed, we compute 
fd) =1-0=1 
Taking the derivative with respect to z evaluated at 1, we have 
(42°)? — 12z7(z* — 1) 1626 — 1226 + 122? 
f'(2) =1 — ——— = 1 - 
(423)? 162° 
in ee 
icf = 4 At 


evaluated at 1 we have f’(z) = 3/4 — 3/4 =0 so that 1 is an attractive (sometimes called super-attractive because f’(z) = 0) 
fixed point of the Newton’s transformation. 


VII.3.2 The Newton’s transformation associated with F(z) =z” + 1 is 
+1 1 


so that the corresponding IFS is got by taking inverses of f(z) to yield 
1 
w = 1/2(z —- > 
2wz=z°-1 


z7—2wz—1=0 


x we aw eS 
SES, ES 


=wtvJw?+4+1 
Consequently, the appropriate IFS is 
wi(z)=zt+V274+1 and wW2(z)=z— z7+1 


For any z € R, z7 + 1 > 0, hence w;(z) € R. For z = 00, we have w;(z) — 00, and w2(z) — 0, consequently R U {00} > 
R U {oo}, and the map is onto, by virtue of the behavior of w2(z) for large z. Then if A = R U {00}, W(A) = A. We must 
make sure we can exclude the fixed points of the transformation. These are given by 


(2 < -) =2Z 
or zr = ti. The derivative of f(z) at these points is 1/2(1 + 1/(z*)) =0 so these are attractive. Hence removal of a small 
€-ball around +i results in a space X C C on which A is a unique invariant set. Therefore A is the attractor of the IFS. 
VII.3.5 The Newton’s transformation associated with F(z) = e* — 1 is given by 


e — 1 


ez 


f(z) =z- =(z—1)+e” 


It has two fixed points in C, for we have 


z=(z-—l1)+e* > et =1>72=0 
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indicating that 0 is a fixed point. We also have 
lim (z — 1) +e *=00 
z—>00 
so that oo is also fixed. The derivative of f(z) is given by 
f'@=1l-e* 


which is zero at z = 0, and approaches | at infinity. Hence the attractive fixed point is 0, oo is a neutral fixed point. 


VII.3.7 We wish to find a Mobius transformation h(z) which establishes a conjugacy between f(z) and g(z). This means 
finding A(z) such that 


f oh(z) =ho g(z) 
where 
f@=1e-1) s@=2 h@= et 


We mention at the outset that such a transformation is indeterminate with respect to scaling factors, so there will be choices 
among a, b, c, and d. Writing out the two terms of the conjugacy we have 


az>+b 
ee ds cz*+d 
and 
1 faz+b cz+d\ 1 ((az+b)*?—-(cz+d)’ 
fone} (ESSE) od (eae 
2\cz+d azt+b z (cz + d)(az + b) 
a4 (= + 2abz + b* — c*z* — 2cd — “) 
a2 acz* + bez + adz + bd 
a (< — c?)z? + 2(ab — cd)z + (b? — d?) ) 
a acz? + (ad + bc)z + bd 
so that 


2az*+2b — (a* —c*)z* + 2(ab — cd)z + (b’ — d’) 


cz+d acz? + (ad + bc)z + bd 
We may eliminate the linear terms in z by setting 
ad+bc=0 ab-—cd=0 


which give us 


bc b*c 
=—-— > —-— ~-cd=03d=-b* 
a nd 7 Cc => 
b 2b 
d=-— sab+— =050'=-¢ 
a 


—* 
With these equalities assumed, we have 


2az*+2b (a? —c*)z* + (6? +”) 
c2+d acz* + bd 
Since, as mentioned, we have choices of scale, we set a = 1. This gives us 
22742 (1—c?)z?+ (6? +d?) 
czz+d cz? + bd 
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and comparing denominators leads to b = 1. Then we have 
2=1-c? and 2=1-d’ 
Cust end d= 


and we again have a choice (this time the sign of the scale factor). We choose c =i. Then ab — cd = 0 or 1 — id = 0, and 
d = —i. Consequently we have 


Checking, we have 


5 iz?7—iz+iz—i 


1 (24+ 8-237 be 4 oe T= (— 7° 4-22 — 1 
fthQ)) = =| — - —— Se 
2\iz-—i zt+l 


2 
ieee us i esp 
—. =i” 


as desired. 


Answers to Chapter VII, section 4 
VII.4.1 We analyze this map by first noting that this map is equivalent to the map 


2 
_jz-l for z; > 0 
ne OA for z, <0 


where z, is the real part of z = z; + iz,. Consequently, for A = 0, and for any open set lying entirely in the right half plane, 
the mapping is open. For A € [—1, 0], a look at a small €-ball about the origin shows why the map is still continuous, namely, 
the action of the term Ax is to shift the points within the ball over towards the right. This produces a closed oval, with points 
to the interior mapping to the interior and the boundary mapping to the boundary. Consequently an interior point is still 
surrounded by an open ball of the image of interior points, and the map remains open. Specifically, given (x, y), if we have 
a point (x), y) to the left of (x, y), then since near the origin, x? <x, the Ax term dominates, and the disk is reversed, 
f (x1, y) is to the right of f(x, y), and points with the same x value map to the same x value, creating an oval around the 
center (x, y). Then an open set maps to an open set. 

The mapping is not open for A = 1 however. To see this, choose first €) < 1/2, so that for points in an €o-ball about the 
origin, 2x? < x. Then for points on the boundary of the left half ball x* + y? = and the intersection with the y-axis, 
the portion on the axis maps to an interval on the real axis [—ej — 1, —1]. The rest of the boundary meets this interval at 
—e? — 1, but lies entirely to the left of it: 


r+y=e 

f(x,y) = @* — y’ — 14 x, 2xy) 
=x? — (ef — x”) —1+x, 2xy) 
= (—e? — 1) + 2x? +x, 2xy) 


Since we have chosen €o such that the x term dominates the 2x? term this indeed lies to the left. If we look at this figure 
from € = €p as € —> 0, then the envelope of the images of the boundary form the boundary of the image of the ball B.,(Q). 
In particular, there are points interior to this ball which map to the boundary of the image. For any such point, a smaller ball 
around that point folds interior to this envelope, exposing it, hence there is no open set around such a point which maps to 
an open set, and the map is not open. 

Finally, by choosing a ball around the origin sufficiently large, for any fixed A, the x’ and y” terms will dominate the Ax 
term so that f(Br(O) > Br(O). In fact, as (x, y) — oo, the map approaches the analytic map f(z) = z—1. 
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Answers to Chapter VIII, section | 


VIII.1.3 We must first show that we satisfy the Lipshitz continuity requirement of Theorem 11.1 in Chapter III, namely for 
each w; there is a k;, independent of p, x, such that 


d(wiy(X), Wig(x)) Ski dp(p, q) 
For w)(x) = A,x, we have 
A(Aj xX, AigX) = lAtyX — Aigx| = Ix||A1p — Aig] = lxldp(p, 9) 
which is not independent of x. Similarly, for w2(x) = A2x + 1 — A2, we have 
A(A2,x + 1 —Ag,, Avge +1 — Arg) = JAa,x +1 —Az, — Aagx — 1 + Adg| = |x — 1ldp(p, gq) 
also not independent of x. We must restrict the domain as in exercise VIII.1.2. We have —.9 < A, A2 < .9, and we look for 
an interval [a, b] such that W({a, b]) C [a, b] for all p € P. 
For A,, A2 > 0, [0, 1] contains all the attractors. To see this, we have 
w;([0, 1}) = [0,1] we(f0, 1]) =[A2, 1] w3((0, 1]) =[1/2, 1] 


When A, < 0, we have w,({0, 1]) = [A;, 0] ¢ [0, 1], and likewise w2([0, 1]) =[1, 1 — A2] ¢ [0, 1]. One pushes the value for 
the necessary lower bound for the interval down, the other pushes the upper bound up. These values reach their minimum 
and maximum when A = Az = —.9. Consequently, we will have found the proper interval [a, b] when 


W ~9,-.9)([a, b]) C [a, 5}. 
Writing this out explicitly, we have 
wi([a, b)) = [—.9b, —.9a] 


w2([a, bj) = [—.9b + 1.9, —.9a + 1.9] 
w3([a, bj) = [.Sa + .5, .5b + .5] 


which means that a, b must satisfy 


a < min(—.9b, —.9b + 1.9, Sa + .5) 
b > max(—.9a, —.9a + 1.9, .55 + .5) 


Now in order to include the interval [0, 1] needed by the positive values for A;, A2, a < 0, and b > 1, which means that 


min(—.9b, —.9b + 1.9, 5a + .5) = —.9b 

max(—.9a, —.9a + 1.9, 5b + .5) =—.9a + 1.9 
Setting the inequalities above to equality to solve we have 

a=-9b b=-.9a+1.9 b=.18b4+1.9 
which yields 
b= 3 22.3 av-—2.1 
so that, for convenience, the interval [—3, 3] contains all the attractors, and we may prove the continuity on the IFS 
{f—3, 3h Ww}, W2, w3} 


to get it for all of R. Looking back to our original x dependencies we now see that by substituting the endpoints of this 
interval for x in the distance expressions, we have the following Lipshitz conditions: 


d(wipx), Wi, (x)) <3 -d(p, q) 
d(w2,(x), w2,(x)) <4-d(p, q) 


Consequently, the attractor varies continuously with A € P. 
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Answers to Chapter VIII, section 2 


VIII.2.2 We first prove the hint. Let N be the set of all points B € #((X) such that B is disconnected. Suppose B € N. Then 
by definition, there are sets B,, Bz C B, such that 


B, U B, = B; B, NB. = 9. 

Furthermore, since these sets are closed in B, they are compact. Then if 5 = h(B,, Bz), 6 > 0. Suppose B’ € H(X) such that 
h(B, B’) < 5/3. Then for any x € B’, we have d(x, B) < 5/3, and for any y € B, we have d(y, B’) < 5/3. Let 

B, = {b € B’: d(b, By) < 5/3} 

B, = {b € B’: d(b, Bz) < 8/3} 
Since each point in B’ must be close to a point in B, we must have 

BU = BL 

Now suppose that x € B; N B, # @. Then 

d(x, B,) + d(x, B2) < = <8 


and by the triangle inequality, we would have, for some b; € B, and b2 € Bo, 
d(b,, bz) < d(x, b;) + d(x, b2) < 4 
which is a contradiction. Hence 
BLN B, = 
and B’ is also disconnected. But B’ was any set within « = 5/3 of B, so there is an open ball B.(B) C N, and N is open in 
- let p be a parameter value such that A(p) € N, that is, A(p) is disconnected. Then there is an € > 0 such that 
B.(A(p)) C N. This ball is open, and by the assumptions of the problem, the map A: P — H(X) is continuous, so 


A~'(B,(A(p))) is an open set in P containing p such that every element corresponds to a disconnected attractor. Hence 
the set of points in P for which A(p) is disconnected is open in P. The complement is M, which must therefore be closed. 


VIII.2.4 For any value of 4 in the parameter space, the attractor A can be written as the set of points of the form 


is y* +k 
k=0 


Since if z € A, then the point 


co 


-z7Z= be os 


k=0 
is also in A, we have , 
A=—-A={z:—-zeEA} 
(in other words the attractor of an element of this family of IFS is itself radially symmetric). Defining 
AA = {Az:z€ A} 
to make the notation simple, we have 
A=(AA+1)U(AA— 1). 
And 


-AA+1=A(—A)+1=AA41 
iA =-1=)(—-A)H 1 = 1A 1 
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in other words, A(A) = A(—A). It then follows that if the attractor of A is connected, so is the attractor of —A since they are 
identical, so M is symmetric about the origin. | 


VIII.2.5 We have the IFS 
{C;Az+1,aAz—1} 
where A = (0, 1/ /2). We may rewrite this as the IFS 


{R?: w(x, y), w(x, y)} 


(3) =(:e2 07) G)+()=”2(2)+(0) 
m(3)=(a5 0 )G)-(0)="2(2) -(0) 


Suppose we look at the action of these two maps on a rectangle centered at the origin. Each map flips the rectangle along 
the corners, reduces each side by 1/-/2, and then translates it either one unit to the left or to the right. We may solve for the 
position of a corner of such a rectangle, symmetric about the origin, such that it went to half the original rectangle. To do 
this, we must have (in absolute value) 


where 


x=1/V2(y) +1 
y =1/V2(x) 


which has the solution x = 2, and y = V2. 

Consequently, if we have the rectangle with corners (+2, +./2), Then each map takes this rectangle to one of the 
horizontal halves of the original rectangle, just-touching along x = 0. The attractor is thus the rectangle, which tiles the 
plane. 


VIII.2.7 Suppose that w,(A) M w2(A) = {Zo}, a single point. We wish to argue that if / is a path which intersects itself, 
then J is a point. Certainly such a path which is not trivial must be contained in either w;(A) or w2(A) or, since it must pass 
through 2p this is a point of intersection. Then we may argue the same about the points w) (Zo) and w2(Zo), so these are points 
of self intersection. By induction, such a non-trivial non-self-intersecting path can not exist. 


VIII.2.8 Let e = e,e2---e,--- be a point in © a code space on the symbols {—1, 1}. For A € C, define 
fA) =Hertertea>+--- 


We want the radius of convergence of f(A). f(A) is bounded above by 1+A+ MW+e0= )_ A” (corresponding to e = 
11). It is bounded below by —1 — A —A2--- = — )-A", corresponding to e = —1—1. These two series have a radius of 
convergence of |A| < 1 and diverge at the endpoints. Consequently f(A) has a radius of convergence of 1. We show that the 
attractor for each A is the set of points 


+1 +024 A7H... 


and that the map f(A)(e) = o(e). The attractor is the limit of the sequence in H(C) given by {W”({0})}. This sequence 
yields 


Or +1 tAt 1H 4tVtALIN-:- 
By induction, A(A) is the set of points 
ti eA S22: 
which is the image of & under both f(A) and ¢: & — A(A). 
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Answers to Chapter Vill, section 3 


VIII.3.3 The attractive orbit here is not a cycle, but rather aperiodic. There are other bubbles with periods equal to powers of 
2 multiplied by the various odd numbers out past this point however. Try experimenting with the numbers around A = 1.75 
for instance. 


Answers to Chapter Vill, section 4 
All exercises in this section are programming experiments. 


Answers to Chapter IX, section 1 
IX.1.3 p, - p3 = 0.03 since the probabilities are independent. 


IX.1.4 The attractor is a Sierpinski triangle with vertices at 0, i, 1. The ball By is located at 2 +i and has radius ./2 and 
therefore intersects the attractor only at the point 1. The probability of getting only one string in code space is 0. 


IX.1.5 pl = .275, p2 = .125, and p3 = .5. We want to know about the ball B,. B, includes only w)(A), hence one expects 
a probability of .275. 


IX.1.6 There is a set of random strings (of measure 0 in code space) for which this number will not occur, e.g. Bz in the 
above problem with the string 11 has incidence 0. 


Answers to Chapter IX, section 2 


IX.2.2 X is the set of leaves, which is finite. ¥ = 2* the collection of all subsets of X. G is the collection of sets consisting 
of one leaf. Any set in F contains a finite number of leaves, therefore it is the finite union of sets in G so by definition, G 
generates F. | 

IX.2.3 In order to prove that S is not in the field # we must show that it can not be written as a finite union of sets in F. 
Suppose that $ = c= A, for some collection of A, € F where each A,, is the union of finitely many subintervals. Then 
at least one of the A, must contain the disjoint union of an infinite number of intervals. But this contradicts the hypothesis 
about A,. Hence S ¢ F. 

IX.2.4 Let X = [0, 1] x [0, 1] C R?, and G be the set of rectangles of the form [p/q, r/s] x [t/u, v/w] that is subrectangles 
with rational corners. Let F denote the field generated by G. Let S € F. We need to show that S has rational area. i) )"_, An 
has rational area, where the A, are subrectangles since the area of the union is given by Ee area(A,,) — oe j area(A; 
A;). ii) Similarly for any set of A, as above, the complement X \ . Ay= ‘ge X \ A, has rational area, because X \ A, 
can be written as the finite union of open subrectangles. Consequently any set S € F has rational area. The area of B(Q, 1) 
is 2 which is irrational, consequently B(Q, 1) is not in the field. 


IX.2.5 Call the sets in the figure 


a G, G2 b 
Se shee ease f 
Be Sl ee: 
Wee. eee ae 


Then we have the following list of equivalences, where those generated earlier in the list are used to generate the rest (once 
they have been established to be in the field). 
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a=X\G, 
X=G,Ua 
Ge X\ Xx 
c=G,\G2 
e=c\G, 
j=X\e 
i=G,\e 
facvi 
oe Bed 
m=iUl 
h=m\e 
o=X\h 
g=G,\o 
b=X\g 


IX.2.6 Let e be the image of the binary number a = .a,a2--- =) i under the map z which takes a; + 1 = e;, in other 
words (a) = e. The complement of e = (a), a irrational, in X is not the union of a finite number of cylinder sets, which 
under the inverse of this map map to intervals of rational width around rational points in [0, 1]. 

IX.2.7 X is a space, let F = 2*, the power set of X. Let A € F. Then since X \ A CX, we have X \ Ae F. Since 
X,@c X,@ €F. Compliments of subsets of X, and finite unions of subsets of X are subsets of X and are therefore elements 
of ¥. Hence F is a field (in fact the biggest field over X). 

IX.2.8 B is the o-field generated by rectangles with rational corners on the unit square. These rectangles form a countable 
base in the metric topology induced by the Manhattan metric, as evidenced by the fact that the o-field contains all balls of 
radius € for all € in the statement, and by the equivalence of metrics. To show this is the Borel field, the proof of Theorem 2.3 
shows that the o-field generated by a countable base for the topology of X is the Borel field. 


IX.2.9 5 is the codespace on the symbols {0, 1}. The cylinder sets are defined as all the sets, for all n, of the form 
C(e--- én) = {Co € B 2g, = C1, -- +5 Con =.¢,,} 


Let e be a point in ©. Then the cylinder set C,(e) is precisely the set of elements x € x such that x; =e; for i <n. 
Consequently, 


~ le; — x;| — le; — xi 
d(e,x)= = ———. < 1/3” 
2 3! = hs / 


By choosing é such that é; = e; fori < n and é; # e; for i > n, we have that C,(e) = B ae (e). The balls of radius 3~ form a 
countable base for the metric topology on &, hence, by the proof of theorem 3, they generate the Borel field B on Z. 


IX.2.10 Let A C R? be the Sierpinski triangle, and G be the set of connected components of R? \ A. Notice that since by 
definition of component, this generating set is disjoint. Consequently, one cannot get a subset of a component (other than 
the component itself or the empty set) by union or compliment of the elements in G. If F is the o-field generated by G then 
F likewise contains no nonempty proper subsets of any component of R? \ A. Let € > 0 be chosen so small that if x is the 
center of the largest triangle inside the Sierpinski set, B,(x) fits entirely inside this component. Then B,(x) is not in ¥, but 
it is in B. Hence F 4 B. F C B however, since any half plane is in B and any triangle is the union of the compliments of 3 
half-planes. 

IX.2.11 Let X be a metric space, G a generating set for the field F,, and for the o-field F. Let F3 be the o-field generated 
by F;. Then 
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Fy =F, U{|_J An: An € Fi) 


n=1 


We must show that this is the smallest o-field containing G. Suppose not. Then there is an A € ¥; such that A ¢ F2. But F, 
is the smallest o-field containing G so the extra entry must be of the form 


UJ An; {An} C Fi. 


n=1 
But the smallest o-field containing G must contain all of these sets, since it contains F;. Hence Fz = F3. 


IX.2.12 Let F be any field of subsets of X. Choose A € ¥. Then X \ A € F and the union of these two is in #. The union 
isAUX\A=X. 


Answers to Chapter IX, section 3 


IX.3.3 Let X =[0, 1] C R F is generated by subintervals, and for a < b we define u((a, b)) = u([a, b]) = b — a, and 
(A) = sum of lengths of subintervals of A. : 

(i) w: F — [0, oo). By definition (A) > 0. Since the sum of lengths of disjoint intervals is less than or equal to 
2({0, 1]) = 1 it follows that (A) < oo. 

(ii) Let [a, b] be an interval, and let b = lim,_...a, for some sequence {a,} such that a, € [a, b]. Then the sum of the 
lengths of intervals in the union 


[a, a,] U [a;, ag] U--- =[a, d] 


w([a, a1) + ) Gna — Gn) 
n=1 


This is a telescoping sum and is therefore equal to 
M([a, a1] + ([a1, limon] = [a, 5] 


Thus since any element in the field which can be written as the union of countably many elements must be unions of this 
form together with finitely many other elements, jz is countably additive. 

(iii) Let S be the set of rationals in [0, 1]. Then S is not in the field F. To see this, we note that neither S nor its compliment 
can be written as the union of finitely many intervals, since both contain countably many components, thus any finite union 
contains an element with countably many components. 

(iv) Let F be the o-field generated by ¥. For any field or o-field, since 


ANB=X\(X\AUX\B) 


the finite intersection of elements of a field are also in the field. Thus for any x = p/g where p,q are integers, the set 
{x} = [a, x] N [x, b] € F and we have jz({x}) = 0. The rationals (and therefore S) are a countable set, consequently we can 
write S = |_J"°, A, where {A,} are the elements of S. Consequently, S ¢ 7. We have 


ju(S) = A(_J An =) A(4,) = D> Ax} =0 
n=1 n=1 x=p/q 
IX.3.4 Let & be the code space on symbols {0, 1}, and define 0 < p,; < 1, pp = 1 — py. We define yu on cylinder sets by 
u(C(e, a en)) — Pe, +0 Dey 


To define jz on other sets in the field F there is really only one way to be consistent with this definition, and satisfy countable 
additivity. We describe this: 


u(X) = u(C(1) UC) =pit+ r=! 
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L(A U B) = pw(A) + u(B) — WAN B) 


where A / B is in the field, since it is equal to X \ ((X \ A) U (X \ B)) by de Morgan’s rules. The easiest way to see how 
this defines a countably additive measure on the field F is to note that for any finite set of codes {e;, ...e,} the measure 
defined on the cylinder sets of these codes taken in any order is the same. Notice also that two cylinder sets are disjoint if 
they disagree on any specified symbol. Taken together, the measure of a union of these sets is the product of the codes on 
which they agree, if they are the same length, and this product times the product of the ‘tail’ of the longer string if they are 
different lengths. There are no elements of the field 7 which can be formed which specify values for an infinite number of 
places, since this is the disjoint union of an infinite number of components. Consequently, 5 = {x : x is odd} is not in F, and 
({x : x7 = 1}) = p;. The set S € F since it can be written as a countable intersection (and therefore a countable union) of 
finite unions of cylinder sets. Its measure, as above, is the product of the probabilities on which the elements agree, namely 
all odd places, and is therefore lim,_... pj = 0. Notice that since the cylinder sets form a countable basis for the metric 


topology on Z, F is the Borel field on =. 
IX.3.5 Since multiplication is commutative, it is easier to define the indices in reverse order. Thus we want the intervals 


Wen O*** 0 We, ([0, 1]) 


as a generator for the field 7. Notice that the interval w,([{0, 1]) has measure p, and the interval w2([0, 1]) has measure p>. 
Define 1(X) = 1. Then we must have 


[L(W1 O We, O-+- 0 We, ([0, 1]) U w2 0 We, O- ++ O We) = PiPen*** Pey + P2Pen*** Pey 


in order to have even finite additivity, but 


PiPen*** Pe, + P2Pen*** Pey = (Pi +1 — Pi) Pen-+* Pe; = Pen’ ** Pen 


so that, since these are the subintervals of w,, o - -- 0 we, ([0, 1]) the ‘middle third’ has measure 0. Hence we have u(B) = 0 
for any B € F such that B Cc X \ A. This implies countable additivity, since every element of F must be the union of 
closed or open intervals beginning or ending on the boundary of a subinterval of the above form, and each subinterval 
is the countable union of subintervals, and satisfies countable additivity. Hence u is a measure on F. This provides 
(A) = 1, w(X \ A) = 9, w([1/3, 2/3]) = 0. 

IX.3.6 We define F on [0, 1] by using the IFS 


1 1 1 
{(0, 1], w,; = a* ag 4* + 5? Pi; P2} 

as in the previous problem. Here the subintervals touch, but the definition from problem 4 of u(X) = 1 yields that the 
intersection of two intervals at their boundaries has measure zero. In this case the field is the set of finite unions of 
subintervals which have boundaries of the form k/2”. By expressing the numbers on the interval in binary, since any element 
of the interval is the countable sum tee (e, — 1)/2” for e; € {1, 2} we generate all the subintervals of the interval, hence 
the extension of F is B([0, 1]). In order to generate the measure which takes the measure of an interval as the length of an 
interval, it is necessary and sufficient to have translation independence. This requires p, = 1/2. 

IX.3.7 Let (X, d) be a compact metric space, a Borel measure on X such that ~(X) #0. Let S be the support of on X. 
Then “(S) = “(X) £0, so that S 4 @. S is closed which implies that S is compact. Hence S € H(X). 

IX.3.8 Let «z be > a measure on 0 -field F, and let F be the class of sets of the form A U B where A € F and BCC € F such 
that u(C) = 0. F is a o-field since clearly X \ B = X \ C UC \ B and C \ B is a subset of an element of F of measure 


zero. The same implies with unions of this form. Define @(A U B) = 7((A) = (A). Then if we have a countable union of 
sets of this form, we have 


(\_J(An U Bn)) = BC{) An) U (LJ Bnd) 
n=] n=1 


n=1 
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which is the union of an element of F with a subset of an element of F of measure zero. Hence 


Ti(\_J(An U By)) = (J An) 


n=1 n=l 


and this latter obeys countable additivity by assumption. Hence F is ac-field, and 7 is a measure on it. 


Answers to Chapter IX, section 4 


IX.4.1 Let f : [0, 1] ~ R be a piecewise constant function with finitely many discontinuities, located at {x,,...x, C R}. 
Let xo = 0, X,4; = 1. Then f has a single value on each of the intervals Jp = [xo, *1], .. - Jn = [%n, Xn4i], and we let y; = 
f(x), x € [x;, Xi41]. Then if x; = Xfx;,5;4.1 is the characteristic function on the interval J;, we may rewrite 


f(x)=)- yx; 
i=0 
If is the Borel measure on [0, 1] given by ([a, b]) = b — a, we have, by definition 


| fdu=) > ym) = 5? yi (Xi41 — Xi): 
n=0 


[0,1 n=0 


For the Riemann integral of f on [0, 1] we have 


1 x Xn 
] f@jdt = / “FGsae do / " f@ddx 
0 x0 Xn 


x} Xn+1 
7 rade +--+ | Yndx 
x 


0 an 


= yo(xX; — Xo) +--+ Yn(Xn41 — Xn) 


= fdp 
(0, 1] 

[X.4.2 We want a measure on (@,Euclidean), defined on the field G = {A: A is a rectangle }, which corresponds to area. 
This means (assuming a < b,c < d),if A = [a, b] x [c, d] or the equivalent expression using open or half open intervals, 
we define (A) = (b — a)(d — c), and for unions of rectangles define ~(A U B) = w(A) + u(B) — w(AN B). Let Ae F 
such that A= |), An, An € F with the A, disjoint. Then for the union to be in ¥ we must be able to express it as a 
finite union of rectangles or the compliments of rectangles. Notice that the compliment of a finite union of rectangles is 
expressible on X as a finite union of rectangles. The measure of this region is then finitely additive by hypothesis, therefore 
pt is countably additive. Suppose there were another measure yz’ on F which yielded the area of rectangles in X. Then since 
both measures need to be countably additive, and since they agree on each subrectangle of X, the two measures agree on F. 
Hence py’ = w. The extension of yu to B is also a measure which yields the area of sets A € B, because B contains all open 
sets in X, the infimum represented in the Caratheodory extension theorem can be replaced by an infimum on open sets, and 
is therefore the area of the closure of A. 

To show that A is in B it suffices to note that it is closed. This follows since the compliment is then open, and can be 
covered by open sets in F. This cover has a finite subcover, which implies that X \ A is a countable union of sets in F, 
hence A € B. 


IX.4.3 The integral is defined on the intersection of two strips with A. To calculate it, we first determine the measure of 
an arbitrary strip [a, b] C [0, 1] intersected with A. The measure is countably additive, so note that given A C A, with the 
measure of A being (A), that we have 


(A) = 4(w;(A) U w2(A)) + 4(w3(A)) = (pi + p2)M(A) + p3H(A) = Su(A) + Su(A) 
Suppose that for n < N w([k/2”, (k + 1)/2"] x RN A) = 1/2" for 0 <k < 2". That is, the measure of A falling within a 
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strip of length 1/2” positioned on a boundary of [0, 1] subdivided into equal 1/2” intervals is its length. We note that each 
such strip intersected with A is a vertical stack of disjoint Sierpinski triangles. We now subdivide the strip into two parts. 
The left-hand parts contains a stack of twice as many triangles, half the size, while the right-hand is a stack of as many 
triangles, half the size. Each triangle in the original strip is produced by transforming 4 by some sequence (w; 0 - - - Wx) 
of transformations, those on the stack in the left-hand portion are either of the form w; 0 --- wy, 0 w; OF Wj O-:: 0 WO W2 
while those on the right have an included transformation by w3. Consequently the measure of the 1/2” width strip is divided 
equally on its two halves. By induction, and finite additivity, the measure of any strip beginning and ending on boundaries 
of the form k/2" is its length. By countable additivity, the measure concentrated on any strip [a, b] x RN & is its length. 
Having demonstrated this, the integral is 1 - 2/3 + (—1) - 1/3 = 1/3. We will use the above result in further exercises. 


IX.4.4 Let a, B € R, and f, g simple functions. Then we have 
af =a) yixz,(x) = > (@ayi)x,(2) 
Bg =B > uxs,(x) =) (Ba)xs,@) 


are simple functions. By definition, af + Bg is af where g = 0, and Bg where f = 0 and the sum elsewhere. We need to 
write the collection of sets (_) J; UU J; as the union of disjoint sets. This is done by writing 


Ky = Ik \ (J Jj, Kmaj =Ji\ dis Kmanteity = 19 Jj, Ke =O elsewhere. 


Define w; = ay;, w; = Bzj, we = ay; + Bz; on the sets defined above. Then w(x) = >> ux Xx, 18 a simple function, and 
equal to af + Bg. The integral of the sum is the sum of the integrals by the above definition of w(x). 

IX.4.6 We need to rewrite the function, as in the above problem, as a simple function with disjoint sets. Writing C = 
A\ B,D=B\A,E=ANB, we have 


ff) = xc + (2.3)xn + 1 + 2.3)XE 


To evaluate the integral, we need the measures on each of these sets. We define them in terms of strings: C is the set of 
strings beginning with 1, and not having a 2 in the second place, i.e, C = C(1, 1) the cylinder set beginning in two 1’s. D 
is the set of strings not beginning with 1 which have a 2 in the second place, or, D = C(2, 2). E is the set beginning with 1 
with a 2 in the second place, or, E = C(1, 2). By the definition of 4 we have u(C) = pipi = (.4)*, u(D) = p2p2 = (.6)’, 
L(E) = pi p2 = (.4)(.6). The integral is given by 


| fdu = (4)? + (2.3)(.6)? + (1 + 2.3)(.4)(.6) = 1.78 
x 


IX.4.7 Define the Borel measure 5, by u(B) = 1 if a € B and 0 otherwise. Let f be continuous f : (X,d) — R. For each 
n, denote by {B; ys a Borel partition of X into disjoint sets B; where diam(B; < 1/n. Define { f,,} to be a sequence of simple 
functions defined on each partition by f, = >_ y; Xe, with y; = f(x;) on the sets which do noi include a and y;, = a on the 
set in each partition a € B;,. Then since the measure of all the sets is zero with the exception of B;, on which it is 1, we have 


| fnd5o - f(a)Vn. 
X 


Hence the limit as n — oo is f(a). Our partition is independent of the choice of x; with the exception of the choice of y,,, 
but the function is continuous, so that as n — oo another choice of x;, approaches a as the diameter of B;, goes to zero. 
IX.4.8 We proceed by first deducing that since f(x, y) is continuous, and the measure on @ is defined on rectangles by their 
area, the value of the integral is the Riemann integral of f(x, y). By definition, the Riemann integral is done by taking a 
sequence of rectangular partitions of ™, and constructing two sequences of simple functions, choosing y;, y; by 


y; = inf { f(x, y): (x, y) € B;}; y; = sup{f (x, y): (, y) € Bi} 


By Theorem 4.1, since f(x, y) is continuous, the two sequences converge to the same limit, namely = fd, since they 
converge and converge to the same limit, the Riemann integral is defined and is equal to this limit. The value of the integral, 
by taking the Riemann integral, is 1/2. 
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IX.4.10 Refering to exercise 3.4, the measure of any strip intersecting A is simply it’s width. The function f(x, y) = x? 
is independent of y so we partition the space into strips, their intersection with & produces decreasing diameters. f(x) is 
continuous, hence this partition and the choice of x; are immaterial. Choose x; to be the left-hand boundary of each strip, 
intersecting some point on A. Then the value of the integral is just x*dx = 1/3. Let X be the set of pixels on a computer 
display. For simplicity, define the distance between the pixel (i, 7) on the i th row and in the j‘" column and the pixel (k, 2) 
as |i — k| + |j — £|, which is a Manhattan metric. Then X may be related to the subset of R? consisting of integer coordinate 
points in the first quadrant of the same size. This set is closed and totally bounded. Hence X is compact. Since there are 
finitely many pixels, we may define the generator for the Borel subsets to be all the subsets consisting of one pixel each. 
This defines the Borel field on X to be the power set of X. Thus any set of pixels is a Borel subset. Suppose there are J rows 
and K columns so that the total number of pixels is N = J x K. Define a Borel measure by (A) = (number of pixels in 
A)/N. Any function from X — R is continuous because the pixels themselves are open sets, hence the inverse image of an 
open set in R is always open. 

We will choose the function f(i, 7) =i + j. Any function on the finite set of pixels can be described by its value on 
each pixel, hence its value at a pixel times the characteristic function at that pixel, hence the sum of such terms makes any 
function simple. Then the integral of f over X is 

bt] 2 ae 
| fan a. wy e+) 


where the summation notation indicates summing over all pixels in X. Since there are K pixels with first value i, and J 
pixels with second value j, we have 


1 wes ee a= 

wait Dea (Ft 3 ”) 
1 (ja 1)) : eee) 

N 2 2 

a tee 217 + NK + }) 

N 


so the integral is 


Answers to Chapter IX, section 5 
IX.5.1 Let 
Ket, pripp ef), 2; 3,6 5 KY} 
and define a metric d : X — R by 
d((i;, ji), (da, Jo)) = la — tal + 1A — Jal 

Then (X, d) is a compact metric space. Define yz to be an atomic measure with 

ae 

MG, J) = 3 2) 
and let v be another atomic measure with 
ae 1 
v(@, J) = Xi 


for every (i, j) € X. 
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We wish to calculate the distance in the Hutchinson metric between jz and v, given by 


dy(u, y=suplf fan —f fav: feces, R), |f(x~) — fO)| < d@, y)Vx, y € X} 


where C(X, R) is the set of continuous functions from X to R. Now X is a discrete space under a metric which is the 
restriction of the Manhattan metric on R*, consequently f : X > R is always continuous. As well, the integrals are the 
integrals of simple functions. Hence for every f, we the integrals can be expressed 


i+ J 
| fau= 2 ft Dey RE 


2 fav = eK 


so the difference is, for any function f, 


i+] 1 
FG, D( gt - u)"% K? rep eke n( 4-1) 


Intuitively, one is tempted to pick the function f (i, 7) =i + j, which among the set of valid functions is the largest of the 
form 


fi, j) =ai + Bj 


However, the range of the term (i + j)/(K + 1) — 1 is from 2/(K + 1) —-1=—(K — 1)/(K + 1) to2K/(K +1)-—1= 
(K — 1)/(K + 1) which gives a negative coefficient to f(i, j) over almost half of X detracting from the value of the sum. 
We must therefore endeavor to pick f(i, j) as large as we can in absolute value, negative when the measure subtraction term 
is negative, and positive when it is positive. That is, we maximize by choosing the largest value for | f| allowable under the 
distance constraint where |(i + j)/(K + 1) — 1] =|C(i, j)| is largest. The interested reader can confirm that this exceeds 
the intuitive result. The value of C(i, j) is constant along each diagonal from (1, ) to (n, 1) and is equal to zero along 
the major diagonal from (1, K) to (K, 1). It is anti-symmetric with respect to this diagonal, negative above it and positive 
below. 

We thus place the largest values for | f| at (1, 1) and (K, K), and since the distance between these is 2K — 2, we set them 
to +(K — 1). We then proceed outward from these points in Manhattan metric circles. The distance between such circles 
(which are the diagonals where they intersect X) is always the horizontal distance between the extensions of the diagonals, 
thus we end up satisfying the distance constraint by putting f to 


—(K-1) -(K-2) ». =1 

—(K-2) -(K-3) - 0 1 
= 0 es (K —3) (K —2) 
0 1 + (K-2) (K-1) 


This may be summed by taking twice the sum above the diagonal, and noticing that there are m entries where (K — 7) is the 
absolute value of the entry, yielding 
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9) K 
eo Se 
K*(K +1) d, 
2 K 
as KXK +1) + 1) > (nk? a In? K + n°) 


n=1 


2K7 << 4K oN, 2 a. 5 
- RK +H 24" 7 RK ED 2” is RK ED 2" 


2K?K(K+1) 4KK(K+1)2K+4+1)  2K*(K +1) 


~ 2K2(K +1) K2(K +1) 4K2(K +1) 


K—-1 
= oo ae v) 


IX.5.2 Since  € P(X) the measure of the entire space is ~(X) = 1. This value is identical with the value on the support 
of which we will call S, that is, this implies (S) = 1. In particular it is non-zero, and since, by Theorem 3.1 in Chapter 
IX statement (3) we have 4(@) = 0, S 4 @. Then by Theorem 3.4 in Chapter IX, the support of a Borel measure on a metric 
space is closed. Since X is compact, a closed subset of X is also compact, hence S is compact. Since S is compact and 
non-empty, 5, the support of wz is an element of 7((X). 


Answers to Chapter IX, section 6 


IX.6.1 We want to show that the Markov operator is indeed a transformation on P(X), that is, M(v) : P(X) > P(X). It 
suffices to show that 


vo w;' : P(X) > P(X) 


since a sum of these operators is a Borel measure if they are, and }°; p; = 1 indicates that they are normalized if each 


entry is. 


(1) oo : : X — X, since for hyperbolic IFS these are invertible maps. In addition, if we define w;'({x}) = = @ whenever 
x ¢ w;(X) then w, '(X) = = X. Hence vo w; ~!(X) = 1, and this is a normalized Borel measure. Hence M(v)(X) = 1. 
(ii) vo wy i is amap X — [0, 00) (actually X — [0, 1]). Furthermore, by definition, if {A; se 2, are mutually disjoint, 


vow; Ua and leeds )) 


joi j=l 


=> v(w; '(A;)) = xe o wy ')(Aj) 


j=l 


so that v o w; | is countably additive. Since w;' maps open sets to open sets (because w; is a contraction, it is continu- 
ous),and since w;'(X (A= w;'(X yr 10 % “— the generators for the Borel subsets into themselves, and pre- 
serves the operation of set complementation. sence w; :B-— B, and the expression v o w;' is well defined. Hence we 
have v o w;!: P(X) > P(X) and therefore M(v) : P(X) > P(X), as desired. 

IX.6.2 We define L by 


L={f €C°(X,R):|f(@) — fO)| <dQ@, y)Vx, y € X} 


and let f € L. Define f=s"! >=; Pi fi © wi, and we wish to show that f €L. Since f is a scalar times the sum of 
compositions of continuous functions, f is clearly in C°(X, R). We must therefore only verify the distance condition. For 
all x, y € X we have 
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IF@) — FO = Is" DIF 0 wile) — fo wi) 
<5!) pilf owi(x) — fo wi(y)| 
<s7')> pilwi(x) — wi(y)| since f € L 
<s7'}° pislx — y| since s = max s; 
<s'-slx—yl- >) pi 
< |x — yl|=d(x, y) since ) | p= 1 


Hence f € L. 
IX.6.7 Let X consist of three points: X = {A, B, C}, and let an IFS be defined by 


{X; w1, W2; pi = .6, po = .4} 


where 
w(A)=B w(B)=B wi (C)=B 
w(A)=C w(B)=A w(C)=C 


Furthermore, let 49(A) = 4o(B) = o(C) = 1/3. We now let jz, = M”(o) and find constant entries for the Markov transi- 


tion matrix M,a,b,...,i, such that 
[Ly (A) a-b=c [n—1(A) 
(ic = (< € r) (1) 
Un(C) g chk Un—-1(C) 


M(u)=)_ pivow;! =.6(uow;')+-4(now;') 


By definition, 


so that 
Ln = 6(Un-1 o w;') — A(Un-1 o) w5') 
and we have 
[n(A) = .6(n-1 0 W;, '(A)) + .4(Un-1 0 Wy '(A)) =0 + 4un-1(B) 
Mn(B) = .6(tn-1(A) + Mn—-1(B) + Un-1(C)) +0 
Ln(C) =0 + 4(Mn-1(A) + Mn-1(C)) 
giving us 


2g APO =A 2D 
=(5 6 6] 
O 4A 


For a discrete space like this, M completely determines M : P(X) > P(X), by defining 2, at each point in the space. In 
the limit, M” —> M, an operator which takes every point in P(X) to some measure yr, and which is therefore necessarily 
singular. By definition wu; = M(u rf). This is an eigenvalue equation which means that 4, is an eigenvector of the eigen- 
value 1. 
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IX.6.8 We first look at the sequence of measures {{,}?29. We have 


pi(A) = > | pimo(w; '(A)) 
p2(A) = D7 pitr(wj'(A)) 
= Di pi) | pitow; (wz '(A))) 
i j 
= D7 Pid Pitto(w;' ow; '(A))) 
i j 


Since the sum after n such iterations is finite, the order is not important and we can write 


Hn(A) = )_ pipj--- Prblo(wy' o--- 0 w7! ow; (A) 
Fash 


where the sum indicates a sum over all the indices i, j,...,k. 

[io is concentrated on A since, by assumption, fo € P(X), and fo(A) = | so the support of jz is a subset of A. If x ¢ A, 
all the terms in the above sum are zero because of this, hence the support of jz, is again a subset of A, and we must have 
tn(A) = 1, because M : P(X) > P(X). Now let yu be the unique invariant measure of the operator M, that is M(u) = wy. 
From the formula just derived with respect to (49 for M” (149), and from the invariance of 2 we must have, for any set B, 

u(B)= >~ pipj--: Pre(wz!o--- ow; ow; (B)) 
bop a5k 
Now choose B = w; 0 w; 0-+: 0 w,x(A). Then we have 


[L(w; ow; 0+:-0 U,(A)) = . PmPn-+: p(we' ow! o-++w,'(w; ow; 0-+- 0 w,(A)) 


SS 


where the sum is over the same number of indices as in the composition for B. Since each element of the sum is positive or 
zero, the sum is greater than or equal to each of its entries, and we have 


p(w; ow; 0o-+-0 w,(A)) 
> pipj-+* Px(W,' 0-++ 0 Ww! ow; (wo Wj 0 +++ 0 WR(A)) 
> Pipj:: Pe(A) = Pi Pj ++ Pr 
Equality holds precisely when this is the only contributing term of the sum, that is 


1g... 0 wr (wi ow; o-+- 0 w,(A)) =0 


uw! Ow, 
whenever the ordered set of indices (m,n,...,1) # (i, j,..., k), which says for all of these ordered sets the set 
we'ow;!o-+-ow, (wo Wj 0-+-0 w,(A)) 
is at least of measure zero, if not empty. It is empty for the totally disconnected case, hence equality holds when the IFS is 
totally disconnected. 
1X.6.11 The measure jz has support on A, consequently it suffices to restrict to Borel subsets of A. We first let the IFS be 
totally disconnected. Then the sets of the form 


W; = Won Oo Wo, | O==*420 Wg, (A) 


for all n form a countable basis on the space A. Hence since for the totally disconnected case, these are all both closed and 
open, for any Borel subset B of A we have 
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for some disjoint choice of W;. But 
ie) le, @) oo 
¢'(B=¢o '(Jw=Ve'm=Uc 
i=! i=l i=l 


where C; are disjoint cylinder sets in &. The measure p on & by observation gives the measure on any cylinder set as 


p(C(o1,..., O%)) = Pi--+ De 


which is precisely the measure jz applied to the corresponding basis set W; on A. Consequently, the measures agree, and we 
have 


p(p"'(B)) = u(B) 


When the attractor is not totally disconnected, we have for each W; finitely many cylinder sets in the image @~'(W;), which 
are disjoint, and correspond to terms in the sum given in exercise [X.6.8. Consequently the measures agree on Borel subsets 
of A in this case as well. Since the Borel subsets of A are precisely the subsets A % B where B is a Borel subset of X, and 
since the support of the measure is A, the equality holds for any Borel subset of X. 


Answers to Chapter IX, section 7 
TX.7.1 We want to prove that ([x, x + 6]) = 6, where yp is the invariant measure of the IFS 


{(0, 1]; wy = 1/2x, wo = 1/2x + 1/2; 1/2, 1/2} 


To do this it suffices to show that this expression holds on any closed interval of the form [k/2”, (k + 1)/2”] C [0, 1], since 
we may then build an approximation of any interval from these, good to any error, by the disjoint union of subsets of this 
form which cover the interval. Since yz is defined as the infimum of the measures of such covers, we let the error go to zero. 

To show that w([k/2”, (k + 1)/2"]) = 1/2" we note that for each iteration of W(A) the image is 2 times the number of 
subintervals of A of half the length of those of the iteration before. Each of these subintervals is of the form 


Wo, O++* 0 Wo, (A) 
where there are k terms in the composition for W*‘(A). Consequently, these intervals of length 1/2 have measure 
[L(We1 O-+- 0 Wox(A) = Po, *** Po, 


Since each term in the product is 1/2, the right hand side is 1/2‘ as desired. 


IX.7.2 We must prove the yz applied to any rectangle in @ gives the area of the rectangle, where yu is the invariant measure 
of the IFS 


{M; w), W2, W3, Wa; 1/4, 1/4, 1/4, 1/4} 


and the w; are given by 


& 


e 
Ww 
me MS eR wm MS & 
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€ 
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that the invariant measure yz is uniform, that is, that for any rectangle, we have (([a, 5], [c, d])) = (c — a)(d — b). We 
argue as in the previous problem: For any square with a side of length oa of the form 


RE Rt1 2-f T+1 
S= ta ha 
(5 on J La an 


we have some sequence i, j,..., k of digits, of length n such that this square is equal to 


) 


S = w; 0w;0---0 u;,(A) 
and hence its measure is 
i y 
Qn 
so that the measure has the uniform property on such squares. These squares form a countable base for the Borel subsets of 
A, consequently any rectangle may be written as 


HS) = pipj- ++ Pr = ( 


[o,@) 


({a, 6}, [c,d)) =|J Si 


i=] 


where the S; have been chosen such that (S; 1 S;) = 0, that is, they meet only on the edges. Consequently, by countable 
additivity, we have w(({a, 5], [c, d])) = (c — a)(d — 5), for this rectangle, consequently the measure of any rectangle in A 
is its area. 


IX.7.4 The distance between two points in ((0, 1], dp) can be written 
d,(a, b) = |d,(a, 0) — d,(b, 0)| 
We start with the sets in each space corresponding to addresses in a cylinder set in code space, namely 
@(C(0}, 02,...,On)) and @,(C (Oj, 02, ..-, On)). 


If the measures agree on the size of these sets, which form a base for the Borel field in each space then since we can rewrite 
f on these as 


f (C(04, 02, «+ Fn)) = Pp(G (C01, 62, - --On))) 


we will have demonstrated that the measures are equal. On [0, 1], almost by definition, the invariant measure v applied to a 
cylinder set yields 


v(C(o1, O2,- +65 On)) = Pip2::: Pn; 


that is, the probability of landing in the cylinder set on n iterations. 
On [0, 1],, the cylinder set corresponds to an interval with endpoints 


010)+++0,0 and .0;02...0,(N — 1) = 0102... (0, + 1)0 


How wide is this interval? Start with n = 1. The interval (.0;, .(0; + 1)) can be read from the IFS we used to construct 
[0, 1],, to be of width p,,. A contraction of the same relative subdivision of [0, 1] is placed inside it to make our “tick 
marks” by the IFS, so that the portion of this subinterval whicn is also in the interval (.0,02, .0;(02 + 1)) is thus Dg, Pop. 
If this proceeds in this fashion up to o;, then the subinterval which is again divided by the action of the IFS is of length 
Po, Por *** Po,» and we take a fraction of this equal to po, , , times its length. Consequently by induction, the length of the 
image of a cylinder set is 


U(bp(C (01, 02, «+ 5 On))) = d(.0102 +++ Fns 0102 *+* (Gn + 1) = Poy Poy *** Pon 
which agrees with the value on [0, 1] for the image of the same set under the invariant measure v. Hence v(A) = w(f (A)), 


and f is measure preserving. 


IX.7.5 The diagonal of the square forms right triangles with the vertical lines generated by the tick marks on the x-axis, and 
with the horizontal grid lines generated by tick marks on the y-axis. The slope of this line is 1, hence the distance between 
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horizontal grid lines on the diagonal is ./2 times the distance on the x-axis. Consequently, a point picked at random on the 
diagonal has a p; chance of falling between the i — 1" grid mark and the i" grid mark, and so forth. With probability 1 then, 
a point on the diagonal will have each digit occuring in proportion to the p; spacing on the x-axis in its x coordinate, and 
the digits occur with equal frequency in its y coordinate. 


Answers to Chapter IX, section 8 
There are no mathematical exercises in section 8. 


Index 


V , definition, 10 


ASS) 
ie 


SAL, definition, 9 
\*/ | definition, 9 
® . definition, 9 
A , definition, 9 
 , definition, 9 
lA|, definition, 58 
AC X, definition, 9 


address, 2, 115, 122, 125, 141, 149, 274, 300, 
359 
definition, 125 
illustration, 116, 117, 118, 119, 120, 121, 124 
periodic, see periodic address 
affine transformation, 2, 43, 45, 49, 51-55, 53, 
70-74, 77, 95, 97, 98, 99, 100, 113, 118, 
119, 130, 136, 142, 145, 211, 219, 225, 
227, 233, 236, 239, 243, 253, 272, 332, 
361, 376 
definition, 49 
illustration, 45, 48, 51, 52, 56, 57, 72, 98, 100, 
101, 102, 103, 104, 106, 107, 233, Plate 9-11 
see also transformation, affine 
algorithm, see fractals; specific algorithms 
for computing 
analytic transformation, 61-68, 74, 289 
definition, 66 
Andes Girl, 377 
illustration, Plate 27 
Arctic Wolf, illustration, Plate 28 
attractive cycle, 147, 268, 271, 314 
definition, 132 
attractive fixed point, 133, 136, 147, 247 
definition, 132 
illustration, 133-148 
attractive periodic point 
definition, 133 
attractor, 81, 83-84, 85, 87, 88, 90, 91, 92, 93, 
94, 95, 96, 97, 99, 100, 101, 103, 111, 113, 
115, 116, 117, 118-119, 122, 123, 125-131, 
133, 134, 136, 141, 145, 146, 149, 152, 153, 


154, 155, 156, 158, 159, 169, 170, 177, 182, 
183, 184, 185, 199, 200, 202, 211, 212, 214, 
215, 218, 223, 226, 232, 234, 243, 246, 252, 
253, 260, 266, 271, 272, 280, 282, 298, 299, 
301, 302, 305, 307, 308, 309, 313, 318, 319, 
327, 331, 332, 333, 343, 345, 347, 359, 363, 
367, 376, 382 

definition, 81 

illustration, 82, 92, 93, 101, 102, 103, 104, 
105, 116, 121, 187, 201, 231, 311 

strange, definition, 81 

A \B, definition, 10 


basin of attraction, 278, 281 
definition, 258 
illustration, 280, 281 
bialogical modelling, 3, 282 
Black Forest, 377-378 
illustration, Plates 31-34 
Bolzano—Weierstrass Theorem, 22 
illustration, 22 
proof, 20 
Borel field, 340, 341, 342, 343, 345, 347 
definition, 340 
Borel measure, 4, 341, 343, 344, 347, 348, 349, 
365, 372 
illustration, Plate 23 
Borel measure, normalized, 349, 350, 404 
definition, 349 
Borel subset, 335, 340, 341, 345, 363, 365, 366 
definition, 335, 340 
boundary, 24, 54, 259, 263, 266, 267, 278, 296, 
298, 301, 305, 310, 314, 317, 318, 324, 326 
definition, 23 
illustration, 18, 24, 25, 104, 120, 259 
bounded, 21, 171, 195, 198, 201, 310 
definition, 20 
totally, 20 
boundedness, 15 
Box Counting Theorem, 175-176, 188, 
190-191, 224 
illustration, 175, 178, 179 
proof, 175 
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2, definition, 43, 44 
ce definition, 7 
C, definition, 7 
Cantor set, 140, 155, 165, 180, 259, 279 
illustration, 44 
Carathéodory’s Extension Theorem, 4, 342 
proof, 342 
see also Extension Theorem 
Cartesian map, 24 
Cartesian product, 26, 123, 229 
definition, 9 
Castle fractal, IFS code for, 186 
illustration, 185 
Cauchy sequence, 16, 17, 19, 20, 35, 36, 38, 42, 
45, 77, 78, 92, 124, 171, 210, 257, 288, 348, 
definition, 15 
illustration, 18, 29, 35, 39 
Cauchy subsequence, 
extension of, 34, 35 
illustration, 35 
cell, 324 
Chaos Algorithm, illustration, 90 
Chaos Game, 2, 3 
for a recurrent iterated function system, 385-388 
chaotic, definition, 167 
chaotic dynamical system, 2, 164, 167-169, 
193, 275, 278, 285 
definition, 167 
characteristic function, definition, 344 
x, definition, 344 
classical Cantor set, see Cantor set, classical 
closed, 20, 21, 23, 24, 172, 288 
definition, 19 
closedness, 2, 15 
closure, 129 
definition, 19 
Cloud Study (video) 113, 377 
illustration, Plate 25 
code space, 78, 84, 119, 120, 121-123, 125, 
126, 127, 128, 129, 130, 140, 141, 146, 149, 
152, 160, 162, 164, 167, 170, 273, 274, 299, 
308, 339, 341, 343, 352, 359, 363, 365 
definition, 18, 120 
Collage, 376-378 
Collage Theorem, 2, 94, 95, 97, 100, 101, 186, 
221, 275, 378, 380 
illustration, 96, 97, 98, 102, 103, 107, 108 
proof, 103 
for a recurrent iterated function system, 392-403 
Collage Theorem for Measures, 360, 361 
illustration, 361 
proof, 360-361 
compact, 20, 22, 23, 24, 41, 79, 92, 123, 124, 
125, 180, 184, 215, 243, 256, 258, 272, 287, 
288, 340, 342 
definition, 20 


compact metric, 146 
compact metric space, see metric space, com- 
pact 
compactness, 2, 15, 23, 29, 159, 172, 299 
completeness, 2, 15 
definition, 16 
see also metric space, complete 
Completeness of the Space of Fractals Theo- 
rem, 35-37 
proof, 36-37 
completion, of a measure, 344 
definition, 344 
computer programs 
Deterministic Algorithm, 85, 87, 210 
Escape Time Algorithm, 249-250, 253, 254, 
277 
see also Escape Time Algorithm, illustration 
hidden variable fractal interpolation function 
with Random Iteration Algorithm, 234-236 
invariant measure of an IFS with probabilities, 
370-372 
illustration, 373 
parameter space coloring, 311-312, 324 
Random Iteration Algorithm, 87—90, 216 
Random Iteration Algorithm with interpola- 
tion and vertical scaling factors, 213-215 
illustration, 215 
web diagram, 135-136 
condensation, 218, 276 
condensation set, 91-94 
definition, 91 
condensation transformation, definition, 91 
conformal, 7-8 
connected, 26, 27, 83, 257, 299, 302, 303, 313, 
314, 318, 326 
definition, 24 
illustration, 28 
connected, multiply, 26, 27 
definition, 26 
illustration, 28 
connected, pathwise, 26, 27, 238, 243, 256, 261 
definition, 24 
illustration, 109 
connected, simply, 26, 27 
definition, 26 
connectedness, 
see also connected; connection 
connection, illustration, Plate 16 
continuous, 40-41, 62, 79, 103, 104, 106, 110, 
Til, 124, 125,127, 128, 147,22 97, 221,357, 
268, 287-289, 347, 350, 363 
continuous, uniformly, 112-113 
definition, 40-41 
continuous deformation, definition, 26 
continuous function, 6, 229, 233, 237, 238, 242, 
245, 349, 356, 357, 370, 371 


definition, 14 
illustration, 8 

continuous transformation, 122, 288, 299 
illustration, 150 

Contraction Mapping Theorem, 75, 217 
proof, 73 

contraction mapping, 2, 4, 74-75, 77, 78, 79, 


80, 81, 83, 84, 91, 92, 95, 102, 104, 131, 217, 


218, 357 
definition, 74 
illustration, 76, 218 
contractive transformation, 76, 80, 236 
illustration, 76 
contractivity, 75, 79, 80, 81, 83, 84, 91, 93, 95, 
96, 97, 102, 111, 158, 163, 218 
contractivity factor, 232, 298, 300, 304 
definition, 74 
convergence, 2, 15, 16, 20, 124, 171, 268, 
277, 288 
definition 15, 16 
illustration, 39, 46 
convergent sequences, 34, 40, 83, 88, 276, 
353, 354, 356 
illustration, 82, 83 243, 244 
coordinate changes, 68-74, 77, 99-109 
illustration, 70, 71 
coordinate space, 68 
illustration, 70 
coordinate system, 68, 69, 71, 72, 176, 190, 
238, 294 
illustration, 69 | 
countable, 22, 84, 122, 125, 318 
definition, 120 
illustration, 124 
countable base, 340 
definition, 340 
critical point, 62, 64, 66, 67, 313 
definition, 47, 66 
illustration, 48 
critical value, 67 
definition, 48, 66 
illustration, 48 
cycle, 170-171, 274, 276 
definition, 132 
see also attractive cycle; repulsive cycle 
cylinder subset, 339 
definition, 339 


aS, definition, 24 
data, 192, 193-196, 205, 208, 211, 213, 215, 
218, 219, 222, 223-224, 233 
definition, 208 
generalized, 229, 232 
definition, 229 
illustration, 195 


deformation, bounded, 13-14 
deformation, of space, 47, 52-53, 179 
degree, of transformation, definition, 47 
5,, (B), definition, 348 
delta function, definition, 348-349 
dense, 167, 168, 170, 274 
definition, 165 
Desktop Fractal Design System, 399 
Deterministic Algorithm, 2, 86, 88, 219 
computer program for, 88-90, 210 


’ deterministic fractals, 379 


deterministic shift dynamical system, illustra- 
tion, 166 
Dy, (A), definition, 201 
dz; (, v), definition, 349 
diam(A), definition, 199 
diameter, of partition, definition, 347 
diffeomorphic, 148 
differentiable, 148, 318 
definition, 148 
disconnected, 27, 192, 319 
definition, 24 
disconnected, pathwise, 26 
definition, 24 
disconnected, totally, 29, 83, 115, 117, 127, 
140, 146, 148, 149, 154, 158, 164, 165, 169, 
170, 183, 186, 199, 206, 253, 299, 302, 303, 
319, 320, 326, 359 
definition, 24, 125 
illustration, 155 
disconnection, illustration, Plate 16 
dynamical system, 2, 81, 130-140, 141-142, 
149, 151-152, 246, 250, 252, 253, 255, 266, 
268, 271, 272, 276, 297, 309, 312, 313, 314, 
317, 318, 324, 326, 327 
definition, 130 
sensitive to initial conditions, definition, 167 
stretch, squeeze, and bend, illustration, 131, 
132 
transitive, definition, 167 
see also chaotic dynamical system; set dy- 
namical system; shift dynamical system; 
slide and fold dynamical system; stretch, 
squeeze, and bend dynamical system; 
transitive dynamical system 
dynamical systems, family of, illustration, 
Plates 17-20 
dynamics, on fractals, 2 


€ , definition, 5 
Elton’s ergodic theorem, 4, 364-370, 376 
€-net, 20 
definition, 20 
equivalence, 2, 3, 146, 147, 148, 164, 165, 
206, 287 
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see also metric space, equivalence; metrically 
equivalent 
Escape Time Algorithm, 3, 246-266, 271, 272, 
277, 282, 288, 289, 292, 293, 319, 324 
computer program, 249-250, 253, 254, 255, 
278 
illustration, 251, 253, 254, 259, 264, 263, 
264-265, 267, 268, 287, 290, 291, 320-324, 
325, 328, 329, Plates 6-14 
Euclidean geometry, see geometry, classical 
eventually periodic point, definition, 133 
see also periodic, eventually 
Extension Lemma, 34-35 
extension of a Cauchy subsequence, 34, 35 
Extension Theorem, Carathéodory’s, 4 


F, definition, 337—338 
f-1, definition, 42, 43 
f-1, (A) (for a set A), definition, 65 
family resemblances, 326 
Feigenbaum ratio, 314 
fern, 92, 170, 282 
IFS code for, 85, 90, 91 
illustration, 32, 39, 91, 166, 346, Plate 3 
fern, Black Spleenwort, 97, 126 
IFS code for, 99 
illustration, 102 
field, 338, 339, 342, 343 
definition, 337-338 
illustration, 339 
see also Borel field; sigma-field 
field, generated by &, definition, 338 
fixed point, 72, 73, 75, 76, 77, 78, 79, 81, 97, 
102, 103, 127, 152, 218, 268, 272, 280, 
288, 352 
definition, 71 
table, 103 
see also attractive fixed point; repulsive 
fixed point 
folds, 46-47 
illustration, 50 
fon» definition, 43 
fractal dimension, 3, 15, 146, 171, 206, 207, 
212, 223, 224, 224-226, 233, 247, 252, 253, 
266, 278, 302, 303, 308 
calculating, 173-175 
definition, 171, 173, 180-181 
experimental determination of, 188-195, 227 
illustration, 188, 190, 191, 192, 193, 194, 
196, 197 
table, 189, 191 
idea behind, 172-173 
illustration, 172, 176, 177, 178, 179, 183, 185, 
186, 187, 201, 202, 203, 204, 231 


of fractal interpolation functions, 223, 226 
fractal function, hidden variable, 240-243 
fractal interpolation function, 3-4, 207, 218, 

239, dbey Cady 200, cal 

definition, 218 

generalized, 233, 237 
definition, 233 
illustration, 239 

hidden variable, 234, 236, 237-238 
computer program for, 234-236 

illustration, 208, 209, 212, 220, 221, 222, 223, 
224, 225, 228-229, 230, 231 

fractal interpolation, 3 

hidden variable, illustration, 237 
fractal systems, 379, 383 
fractal tree, 2, 93 
fractal, deterministic, definition, 79 
fractals, algorithms for computing, 2, 84-91 

see also “Chaos Game”; Deterministic 

Algorithm; Random Iteration Algorithm 
function 

characteristic, definition, 344 

simple 

definition, 344 
see also simple function 
see also specific functions 


generalized data, definition, 229 
see also data, generalized 
geodesics, definition, 10 
geometric series, 46 
illustration, 46 
geometry, classical, 1, 205, 206, 207 
illustration, 114, 206 
geometry, deterministic, 4, 42, 236, 373 
graph, 233, 234, 237, 240, 243, 252, 272, 345 
definition, 229 
group, 73, 74 
definition, 73 


H(X), definition, 27 
illustration, 30 
Hausdorff distance, 45, 79, 95, 207, 289, 297 
definition, 32 
illustration, 32 
Hausdorff metric, 33, 79, 80, 85, 95, 111, 207 
219, 268, 289, 326, 359 
illustration, 112 
Hausdorff p-dimensional measure, 201 
definition, 200 
illustration, 201 
Hausdorff-Besicovitch, 3 
Hausdorff-Besicovitch dimension, 200, 202, 


203, 204, 205, 295 
definition, 202 
illustration, 205 
hidden variable, 234-235 
definition, 234—235 
see also fractal interpolation function, 
hidden variable 
homeomorphic, 14, 145-146, 299 
definition, 145-146 
homeomorphism, 15, 41, 145, 146, 148, 152, 
165, 299 
definition, 14 
illustration, 29, 147 
Hutchinson metric, 349, 351, 406 
definition, 349 


IFS, 2, 81, 83, 84, 85, 87, 88, 90, 91, 93, 
94-101, 136, 140-145, 146-148, 149, 165, 
169, 182-183, 203, 207, 210, 211-213, 214, 
215-216, 218, 223, 226, 232, 234, 236, 
240-243, 245, 246, 253, 266-267, 270, 278, 
280, 282, 304-306, 352 

definition, 81 
hyperbolic, 149-152, 159-161, 168, 170, 183, 
185, 186, 199, 202, 209, 210-211, 215-216, 
218, 219, 232, 243, 252, 254, 272, 274, 298, 
299, 301, 304, 313-314, 318, 350, 352, 
360-363 
hyperbolic with condensation, 92-94, 119 
definition, 91 
illustration, 96, 98, 100, 112, 114, 187, 231 
lifted, 154, 168 
definition, 154 
Random Iteration Algorithm applied to 
computer program, 213-214, 234-236, 238 
with condensation, illustration, 92, 93, 94 
with probabilities, 331, 332, 335, 343, 345, 
351, 354, 359, 360, 364 
illustration, 362, 363, 364 
invariant measure of, 352 
computer program, 370-372 
illustration, 373 
see also iterated function system 
IFS code, 219, 376, 377, 378 
for a measure on §M, 331, 332, 334 
illustration, 332, 333, 334 
for Black Spleenwort, 99 
for Castle fractal, 186 
for Fern, 86, 90, 102 
for Sierpinski triangle, 85, 89-90 
for M, 238, 240, 243 
for Square, 86 
for Tree, Fractal, 86 
image animation, 111 
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Inf S, definition, 33, 74-75 
infinum, definition, 33, 75 
Sxf dw, definition, 348 
integral, 348 
definition, 345, 348 
of a continuous real-valued function, 4 
interior, 25 
definition, 23 
illustration, 24, 25 
interpolation function, 209, 210, 229, 233 
definition, 208, 229 
see also piecewise linear interpolation 
function 
interpolation points, 218, 223, 229 
definition, 209, 229 
interval [a, b], definition, 9 
invariant, 15, 165, 280, 288 
illustration, 16 
invariant measure, 360-363, 365, 366, 372, 376 
definition, 334-335 
illustration, 374, 375 
of IFS with probabilities, 359, 360, 361, 366 
definition, 352 
illustration, 363, 364 
invertible, 14, 49, 58, 68, 149, 150, 153-154, 
158, 184, 217, 299 
definition, 42-43 
invertible transformation, 65, 73, 159-160 
iterated function system, 2, 330 
definition, 80 
with probabilities, 330-331 
definition, 330 
see also IFS, with probabilities 
see also IFS 
iterates, 47 
backward, definition, 43 
forward, definition, 43 


jet exhaust, 3, 205-206, 207 
see also jet flame 
jet flame, 191-193 
illustration, 193, 194 
Julia set, 3, 61, 256, 258, 259-263, 266, 267, 
270, 271, 272, 273, 274, 275, 276, 278, 280, 
281, 282, 287, 297, 310, 312, 313, 314, 317 
definition, 256, 275 
hyperbolic, 271 
illustration, 269, 275, 277, 279, 283-286 
318, Plates 13, 16 
Julia set, filled, 255, 258, 259, 260, 263, 266, 
271, 272, 289 
definition, 255-256 
illustration, 259, 260, 261, 262, 315, 316 
just-touching, 116, 117, 125, 126, 142, 149, 
151, 152, 157, 161, 165, 183, 184, 185, 199, 
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200, 211, 224, 238, 263, 274, 319, 320 
definition, 125 
illustration, 151 
just-touching collage, 103-106, 113-114 
illustration, 105 


landscape, 377 
illustration, Plates 21, 22 

Leaf and Sunflower, condensation sets, 
illustration, Plate 29 

leaf, 52, 53, 65, 97, 99, 103, 155, 377 
illustration, 52, 96, 103, 336, Plates 23, 24 

Lebesgue measure, definition, 344 

lily, 324 

Lim inf, 177 

limit, 16, 18 
definition, 16 

limit point, 19 
definition, 19 
illustration, 18 

Lim sup, 177, 181, 182 

linear fractional transformation, see Mobius 

transformation 

linear space, 5, 6 

linear transformation, 51, 54, 57, 78 
illustration, 51, 52, 78 

local iterated function systems, 380 


ol, definition, 315 
alt (A, p, €), definition, 198 
elt (A, p), definition, 198 
Mandelbrot, B., 1 
Mandelbrot set, 4, 301, 302, 303, 304, 305, 
306, 312, 313, 314, 317, 318, 324, 327 
definition, 301, 312 
illustration, 301, 303, 304, 305, 306, 307, 
308, 309, 310, 313, 315, 316, 317, 318, 319, 
329, Plate 16 
map, 3, 50, 83, 97, 128, 128, 141, 152, 154, 
165, 183, 186, 200, 203, 207, 209-211, 
212, 233, 240, 243, 263, 268, 271, 272, 274, 
278, 288, 289, 294-299, 301, 302, 308, 309, 
310, 317, 319, 324, 326, 361, 363, 376 
illustration, 56, 57, 59, 60, 65, 295, 296 
Plates 12, 14, 15 
mapping, 43, 58, 59, 62, 63, 64, 147, 317 
see also contraction mapping 
Markov chain, 392~393 
Markov operator, 350, 351, 352, 354, 360, 
406-409 
definition, 350 
illustration, 353, 354, 355, 356, 357, 358, 359 


measure, 4, 332, 335, 336, 341, 342, 343, 344, 
345, 347, 350, 351, 352 
definition, 341 
Mercator’s projection, 24, 278 
illustration, 280 
metric, 11, 14, 29, 78, 99, 215, 218, 229, 232, 
272, 344, 349 
compact, 146 
definition, 10 
equivalent, 12, 215-216, 273 
definition, 12 
Euclidean, 11, 12, 14, 45, 58, 77, 173, 
175-176, 180, 195, 198-200, 210, 215-216, 
219, 246, 256 
illustration, 18 
illustration, 11, 13, 18 
Manhattan, 11, 12, 58, 77, 161, 180, 207 
metric equivalence, 24 
metric space, 2, 3, 10-41, 43, 68, 77, 79, 81, 
92, 111, 147, 148, 165, 167, 171, 172, 173, 
174, 177, 180, 181, 195, 198, 199, 213, 217, 
219, 238, 288, 297, 341, 342, 343, 344, 345, 
348, 349 
closedness of, 2 
see also closed; closedness 
compact, 4, 38, 75, 77, 78, 84, 122, 146, 272, 
299, 340, 344, 347, 348, 349, 350, 351, 
352, 359 
illustration, 76 
compactness of, 2, 77 
see also compact; compactness 
complete, 17, 20, 27, 31, 34, 75, 80, 102, 104, 
111, 122, 123, 172, 180, 216, 229, 232, 237 
completeness of, 2 
see also completeness 
connectedness of, 2 
see also connected; connectedness 
convergence of, 2 
see also convergence 
definition, 15 
equivalence of, 2, 15 
illustration, 21, 29 
equivalent, 12, 13, 14, 18, 19, 21, 22, 
126-129, 145-147 
illustration, Plate 1 
openness of, 2 
see also open; openness 
subsets of, 2, 15 
transformation on, 2 
see also transformation 
metrically equivalent, 145, 146, 165, 171, 
177, 200, 300 
definition, 145-146 
illustration, 16 
minimal period, definition, 132 
Mobius transformation, 2, 49, 50, 58-61, 72, 


74, 75, 83-84, 131, 287 
definition, 49, 58 
illustration, 59, 60, 61, 133, 134 
model, 29, 159, 173, 190, 287, 312, 347, 373, 
376 
Monterey Coast, 377 
illustration, Plate 26 
mountain, illustration, 106 
Myreberg point, 314 


MN (a, €), definition, 172 

natural scene, 374 

nested, 43-44, 279 

Newton transformation, 280, 281, 282, 317 
definition, 276 
illustration, 287, Plate 14 

Newton’s method, 4, 276-287 

Non-Wavering Domain Theorem, 278 

norm, of a point, definition, 58 

norm, of the linear transformation, 
definition, 58 

numits, 249, 250, 252, 254, 320, 331, 370 
definition, 248 


one-parameter family, 130, 136, 149, 256, 272 
illustration, 230, 231 
one-to-one, 14, 43, 68, 140, 146, 157 
definition, 42-43 
onto, 14, 43, 68, 120, 124, 125 
definition, 42-43 
open set condition definition, 125 
open set, 62, 268, 289, 290 
open, 21, 23, 257, 288, 340 
definition, 20 
openness, 2, 15 
orbit, 2, 130, 131, 132, 134, 135, 136, 140, 153, 
156, 157, 158-163, 165, 167-169, 247, 255, 
257, 258, 268, 276, 277, 278, 287, 288, 310, 
313 
chaotic, 164-165, 169 
definition, 130 
illustration, 141, 142, 143, 144, 165, 166, 248 
outer measure, definition, 341 
overlapping 116-117, 119, 125, 126, 149, 151, 
152, 153, 155, 162, 183, 186 
definition, 116-118, 125 
illustration, 121, 152, 155 


+ (for set and real numbers), definition, 33 
parameter space, 3-4, 296, 297, 298, 299, 301, 
304, 306, 308, 310, 314, 317, 320, 326, 327 
illustration, 297, 301, Plates 17-20 


parameter space coloring, computer program 
for, 311-312, 314 
parameters, 23, 104, 106, 212, 233, 234, 271, 
272, 300, 301, 305, 306, 314 
continuous dependence of fractals on, 2, 
101-114 
continuous dependence of invariant measure 
on, illustration, Plate 25 
partition, definition, 347 
partition, diameter of, definition, 347 
path, 24, 26 
definition, 24 
illustration, 26 
perfect, 19, 44, 83 
perfection, 15 
definition, 19 
period, definition, 128 
periodic address, definition, 128 
periodic point, 129, 132, 274 
definition, 128, 132 
see also attractive periodic point; repulsive 
periodic point 
periodic, eventually, 133, 136, 141-142, 147, 
152 
definition, 129 
p; & L, definition, 360 
Photocopy Machine Algorithm, 382 
illustration Plates 37, 40 
piecewise linear interpolation function, 210, 
212, 246 
illustration, 210 
point x, illustration, 6, 7, 18 
Point at Infinity, 255, 257, 258, 268, 272, 
310, 313 
polynomial transformation, 46, 49, 67, 258, 270 
definition, 46 
illustration, 47, 48, 50 


quadratic transformation, 61, 62, 64-65, 67, 74, 
253, 257, 266, 280 
illustration, 63, 64, 65 
quadree 
definition, 118 
illustration, 119 


R, definition, 54 

R, definition, 5 

Rg, definition, 54 

Random Iteration Algorithm, 84, 85, 87-88, 
89-90, 130, 165, 168, 169, 213, 267, 271, 
272, 330, 332, 334, 335, 347, 352, 365-366, 
376 

illustration, 113, 267, 268, 269, 332, 333, 334, 
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33$7336,;337 
Random Iteration Algorithm, computer pro- 
gram, with interpolation and vertical 
scaling factors, 213-214 
Random Iteration Algorithm, computer pro- 
gram, 89-90, 210 
random shift dynamical system, 149, 153, 156, 
158, 159, 160, 161, 168, 167-168 
definition, 153 
illustration, 153, 163 
rational numbers, 167 
definition, 122 
rational] transformation, 66, 271 
recurrent iterated function systems, 379 
Chaos Game for, 385-388 
Collage Theorem, 392-403 
Photocopy Machine Algorithm for, 388-392 
reflection, 59 
definition, 54 
repelling sets, 3 
repulsive cycle, 2, 141-142, 275 
definition, 132-133 
repulsive fixed point, 132, 141, 147, 247 
definition, 132-133 
illustration, 133-148 
repulsive periodic point, 132-133, 278 
Riemann sphere, 8, 43, 255, 268, 272, 278 
illustration, 8, 256, 270, 273, Plate 14 
Mobius transformations on, 58, 61, 64-65 
rotation, definition, 54 


So, definition, 23 
scaling factor, 183, 184, 186, 199, 238, 263, 
302, 318 
definition, 54 
illustration, 56 
see also vertical scaling factor 
semigroup, 73, 74, 77 
definition, 73 
sensitive to initial conditions, definition, 167 
sequence, of intervals, 45 
illustration, 45, 46 
set dynamical systems, definition, 136 
set-valued inverse, definition, 65 
Seurat, 376-378 
Shadow Theorem, 2-3, 154, 156, 157, 158 
illustration, 156, 157 
Shadowing Theorem, 3, 159-160, 161, 162, 169 
illustration, 162, 163, 164 
proof, 160 
shear transformation, 212, 213-215 
definition, 212 
illustration, 212, 226 


shift dynamical system, 2, 140, 142, 146-147, 
148, 149-158, 159, 160, 161, 162, 163, 164, 
165, 167, 168, 169-170, 246, 252, 254, 273, 
274, 276, 319, 327 

definition, 140 

illustration, 141, 151, 152, 153, 247 

lifted, definition, 154 

see also deterministic shift dynamical 
system; random shift dynamical system 

shift operator, definition, 74 

shift transformation, definition, 140 

Sierpinski space A, definition, 9 

Sierpinski triangle, 9, 38, 49, 54, 62, 65, 77, 84, 
87, 91, 115-116, 118, 119-120, 122, 149, 
152, 161, 176, 184, 198, 200, 243, 246-249, 
250, 252, 259, 289, 332, 336, 341, 345-347 

IFS code for, 84-85, 88-90 

illustration, 18, 50, 63, 65, 88, 89, 117, 150, 
177, 199, 245, 247, 248, 335, 346 

table, 342 

>, definition, 8, 122 

o-field, definition, 340 

sigma-field, 340-341, 342, 343, 345 

sigma-field, generated by G, definition, 340 

similitude, 54, 58, 59, 60, 62, 65-66, 67, 68, 74, 
75, 97, 98, 99, 113, 183, 186, 199, 200, 238, 
263, 302, 318, 326 

definition, 54 
illustration, 55, 63, 96, 98, 103, 186 
simple function, 344, 347, 348, 350 
definition, 344 
illustration, 346 
slide and fold dynamical system, 136 
illustration, 138 
space X, 5—10 
definition, 5 
points of, 5 
space, see metric space 
space-filling curves, 238-245 
illustration, 241, 242, 244, 245 
IFS code for, 238, 240, 243 

Square, IFS code for, 86 

strange attractor, see attractor, strange 

stretch, squeeze, and bend dynamical system 
illustration, 131, 132 

Sunflower Field, 377, 378 

illustration, Plate 30 
Sup S, definition, 33, 75 
support, of a measure, 344 
definition, 344 
supremum, 200 
definition, 33, 75 


topologically conjugate, 147 
definition, 146 
topologically equivalent, 146, 152 
definition, 145 
topology, 13, 15 
illustration, 13 
totally disconnected, 115 
definition, 125 
see also disconnected, totally 
transformation, 2, 29, 42-114 
affine, 2, 42, 43, 45 
see also affine transformation 
contractive, 75 
definition, 42-43 
illustration, 21 
invertible, 43 
Mobius, see Mobius transformation 
rational, 42 
real line, 42-49 
see also affine; analytic; condensation; 
continuous; invertible; Mobius; Newton; 
one-to-one; onto; polynomial; quadratic; 
rational; and shear transformations 
transitive, 167 
definition, 167 
translation, 45, 51 
illustration, 52 
tree, 38, 93 
illustration, 92 


Tree, fractal, IFS code for, 86 
illustration, 93 

turbulent jet, see jet flame 

Twin-Dragon Fractal, 306 
illustration, 310 


uncountable, 120 
definition, 120 


Vector Recurrent Iterated Function System 
(VRIFS™), 408-409 
illustration, 409, Plates 37, 38, 39 
vector space, see linear space 
vertical scaling factor, 212~—215, 223, 227 
vertical scaling factor, definition, 212 


web diagram, 136, 148, 163-165, 260 
computer program, 135-136 
definition, 135 
illustration, 134, 137, 148, 163, 166 


X, definition, 8 
x, definition, 71 
\X|, definition, 
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